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RNA DETECTION ASSAYS 

The present application is a continuation-in-part of U.S. Patent Application No. 
09/864,636, and is a continuation-in-part of Patent Application No. 09/864,426, each of 
which is a continutation-in- part of U.S. Patent AppUcation No. 09/577,304 and 
09/758,282 and is a continuation-in-part of U.S. Patent AppHcation No. 09/350,309, 
which is a divisional appUcation U.S. Patent No. 5,985,557, and is also a continuation-in- 
part of co-pending application serial no. 09/381,212 which is a national entry of PCT 
application no. US 98/05809, which claims priority to U.S. Patent Nos, 5,994,069, 
6,090,543, 5,985,557, 6,001,567, and 5,846,717 and PCT application no. US 97/01072. 

FIELD OF THE INVENTION 

The present invention relates to compositions and methods for the detection and 
characterization of nucleic acid sequences and variations in nucleic acid sequences. The 
present invention relates to methods for forming a nucleic acid cleavage structure on a 
target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. 
For example, in some embodiments, the 5' nuclease activity of a variety of enzymes is 
used to cleave the target-dependent cleavage structure, thereby indicating the presence 
ofspecific nucleic acid sequences or specific variations thereof. 

BACKGROUND OF THE INVENTION 

Methods for the detection and characterization ofspecific nucleic acid sequences 
and sequence variations have been used to detect the presence of viral or bacterial nucleic 
acid sequences indicative of an infection, to detect the presence of variants or alleles of 

genes associated with disease and cancers. These methods also find application in the 
identification of sources of nucleic acids, as for forensic analysis or for patemity 
determinations. 

Various methods are known to the art that maybe used to detect and characterize 
specific nucleic acid sequences and sequence variants. Nonetheless, with the completion 
of the nucleic acid sequencing of the human genome, as well as the genomes of numerous 
pathogenic organisms, the demand for fast, rehable, cost-effective and user-friendly tests 



for the detection of specific nucleic acid sequences continues to grow. Importantly, these 
tests must be able to create a detectable signal from samples that contain very few copies 
of the sequence of interest. The following discussion examines two levels of nucleic acid 
detection assays currently in use: 1. Signal Amplification Technology for detection of 
5 rare sequences II. Direct Detection Technology for quantitative detection of sequences, 
and IIL Direct Detection of RNA. 

I. Signal Amplification Technology Methods For Amplification 

The "Polymerase Chain Reaction" (PGR) comprises the first generation of 
10 methods for nucleic acid ampUfication, However, several other methods have been 
developed that employ the same basis of specificity, but create signal by different 
□ amplification mechanisms. These methods include the "Ligase Chain Reaction" (LCR), 
S "Self^Sustained Synthetic Reaction" (3SR/NASBA), and "QP-Replicase" (Qp). 

W4 

i|| 15 Polymerase Chain Reaction (PGR) 

Q The polymerase chain reaction (PGR), as described in U.S. Patent Nos. 4,683,195, 

!J 4,683,202, and 4,965,188 to Mulhs and MuUis et al (the disclosures of which are hereby 

\ if 

CB incorporated by reference), describe a method for increasing the concentration of a 

% segment of target sequence in a mixture of genomic DNA without cloning or purification. 

't 

20 This technology provides one approach to the problems of low target sequence 

concentration. PGR can be used to directly increase the concentration of the target to an 
easily detectable level. This process for amplifying the target sequence involves 
introducing a molar excess of two ohgonucleotide primers that are complementary to 
their respective strands of the double-stranded target sequence to the DNA mixture 

25 containing the desired target sequence. The mixture is denatured and then allowed to 
hybridize. Following hybridization, the primers are extended with polymerase so as to 
form complementary strands. The steps of denaturation, hybridization, and polymerase 
extension can be repeated as often as needed, in order to obtain relatively high 
concentrations of a segment of the desired target sequence. 
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The length of the segment of the desired target sequence is determined by the 
relative positions of the primers with respect to each other, and, therefore, this length is a 
controllable par^eter. Because the desired segments of the target sequence become the 
dominant sequences (in terms of concentration) in the mixture, they are said to be 
5 "PCR-amplified." 

Ligase Chain Reaction (LCR or LAR) 

The hgase chain reaction (LCR; sometimes referred to as "Ligase Amplification 
Reaction" (LAR) described by Barany, Proc. Natl. Acad. Sci., 88:189 (1991); Barany, 
PCR Methods and Applic, 1 :5 (1991); and Wu and Wallace, Genomics 4:560 (1989) has 
10 developed into a well-recognized alternative method for amplifying nucleic acids. Li 
O LCR, four oHgomicleotides, two adjacent oUgonucleotides that uniquely hybridize to one 
% strand of target DNA, and a complementary set of adj acent oligonucleotides, that 
Pf hybridize to the opposite strand are mixed and DNA ligase is added to the mixture. 
4i Provided that there is complete complementarity at the junction, ligase will covalently 

15 link each set of hybridized molecules. Importantly, in LCR, two probes are Ugated 
[I together only when they base-pair with sequences in the target sample, without gaps or 
jji mismatches. Repeated cycles of denaturation, hybridization and ligation amplify a short 
y segment of DNA. LCR has also been used in combination with PCR to achieve enhanced 
detection of single-base changes. Segev, PCT Public. No. W09001 069 Al (1 990). 
20 However, because the four oligonucleotides used in this assay can pair to form two short 
Ugatable fragments, there is the potential for the generation of target-independent 
background signal. The use of LCR for mutant screening is lunited to the examination of 
specific nucleic acid positions. 

Self-Sustained Synthetic Reaction (3SR/NASBA) 

25 The self-sustained sequence replication reaction (3SR) (GuatelU et al, Proc. Natl. 

Acad. Sci., 87:1874-1878 [1990], with an erratum at Proc. Natl. Acad. Sci., %1:1191 
[1990]) is a transcription-based in vitro ampUfication system (Kwok et al, Proc. Natl. 
Acad. Sci., 86:1173-1177 [1989]) that can exponentially amplify RNA sequences at a 
uniform temperature. The amplified RNA can then be utiUzed for mutation detection 

30 (Fahy et al, PCR Meth. AppL, 1:25 [1991]). hi this method, an oligonucleotide primer is 
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used to add a phage RNA polymerase promoter to the 5' end of the sequence of interest. 
In a cocktail of enzymes and substrates that includes a second primer, reverse 
transcriptase, RNase H, RNA polymerase and ribo-and deoxyribonucleoside 
triphosphates, the target sequence undergoes repeated rounds of transcription, cDNA 
synthesis and second-strand synthesis to amplify the area of interest. The use of 3SR to 
detect mutations is kinetically limited to screening small segments of DNA (e.g., 200-300 
base pairs). 

Q-Beta (QP) Replicase 

In this method, a probe that recognizes the sequence of interest is attached to the 
replicatable RNA template for Qp replicase. A previously identified major problem with 
false positives resulting from the replication of unhybridized probes has been addressed 
through use of a sequence-specific ligation step. However, available thermostable DNA 
ligases are not effective on this RNA substrate, so the ligation must be performed by T4 
DNA ligase at low temperatures (37°C). This prevents the use of high temperature as a 
means of achieving specificity as in the LCR, the Ugation event can be used to detect a 
mutation at the junction site, but not elsewhere. 

Table 2 below, Usts some of the features desirable for systems useful in sensitive 
nucleic acid diagnostics, and summarizes the abilities of each of the major amphfication 
methods (See also, Landgren, Trends in Genetics 9:199 [1993]). 

A successful diagnostic method must be very specific. A straight-forward method 
of controlling the specificity of nucleic acid hybridization is by controlling the 
temperature of the reaction. While the 3SR/NASBA, and QP systems are all able to 
generate a large quantity of signal, one or more of the enzymes involved in each caimot 
be used at high temperature (i.e., >55°C). Therefore the reaction temperatures cannot be 
raised to prevent non-specific hybridization of the probes. If probes are shortened in 
order to make them melt more easily at low temperatures, the hkelihood of having more 
than one perfect match in a complex genome increases. For these reasons, PGR and LCR 
currently dominate the research field in detection technologies. 
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Feature 


Method 


PCR LCR PGR & 3SR Qp 
LCR NASBA 


Amplifies Target 




+ 




+ 




Recognition of Independent 
Sequences Required 


+ 


+ 




+ 




Performed at High Temp. 


+ 


+ 








Operates at Fixed Temp. 








+ 
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Exponential Amplification 


+ 


+ 


+ 


+ 
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Generic Signal Generation 










+ 


Easily Automatable 













The basis of the ampUfication procedure in the PCR and LCR is the fact that the 
5 products of one cycle become usable templates in all subsequent cycles, consequently 
doubhng the population with each cycle. The final yield of any such doubling system can 
be expressed as: (1+X)^ = y, where "X" is the mean efficiency (percent copied in each 
cycle), "n" is the number of cycles, and "y" is the overall efficiency, or yield of the 
reaction(Mullis,PCRMethods Applic, 1:1 [1991]). If every copy of a target DNA is 
10 utilized as a template in every cycle of a polymerase chain reaction, then the mean 
efficiency is 100%. If 20 cycles of PCR are performed, then the yield will be 2^^, or 
1,048,576 copies of the starting material. If the reaction conditions reduce the mean 
efficiency to 85%, then the yield in those 20 cycles will be only 1.85^^, or 220,513 copies 
of the starting material. In other words, a PCR running at 85% efficiency will yield only 
15 21% as much final product, compared to a reaction running at 100% efficiency. A 
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reaction that is reduced to 50% mean efficiency will yield less than 1% of the possible 
product. 

In practice, routine polymerase chain reactions rarely achieve the theoretical 
maximum yield, and PCRs are usually nm for more than 20 cycles to compensate for the 
5 lower yield. At 50% mean efficiency, it would take 34 cycles to achieve the million-fold 
amplification theoretically possible in 20, and at lower efficiencies, the number of cycles 
required becomes prohibitive, hi addition, ^y background products that ampUfy with a 
better mean efficiency than the intended target will become the dominant products. 
. ^ Also, many variables can influence the mean efficiency of PGR, including target 

1 0 DNA length and secondary structure, primer length and design, primer and dNTP 
concentrations, and buffer composition, to name but a few. Contamination of the 
q reaction with exogenous DNA (e.g., DNA spilled onto lab surfaces) or cross- 

y. contamination is also a major consideration. Reaction conditions must be carefully 

IS 

=fi optimized for each different primer pair and target sequence, and the process can take 
y 15 days, even for an experienced investigator. The laboriousness of this process, includmg 
numerous technical considerations and other factors, presents a significant drawback to 
O using PGR in the clinical setting. Indeed, PGR has yet to penetrate the clinical market in 
fl a significant way. The same concems arise with LGR, as LGR must also be optimized to 
j£ use different oUgonucleotide sequences for each target sequence, hi addition, both 

r| 20 methods require expensive equipment, capable of precise temperature cycling. 

Many applications of nucleic acid detection technologies, such as in studies of 
allelic variation, involve not only detection of a specific sequence in a complex 
background, but also the discrimination between sequences with few, or single, 
nucleotide differences. One method for the detection of allele-specific variants by PGR is 
25 based upon the fact that it is difficult for Tag polymerase to synthesize a DNA strand 
when there is a mismatch between the template strand and the 3' end of the primer. An 
allele-specific variant may be detected by the use of a primer that is perfectly matched 
with only one of the possible alleles; the mismatch to the other allele acts to prevent the 
extension of the primer, thereby preventing the ampUfication of that sequence. This 
30 method has a substantial limitation in that the base composition of the mismatch 

influences the ability to prevent extension across the mismatch, and certain mismatches 
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do not prevent extension or have only a minimal effect (Kwok et al, Nucl. Acids Res., 
18:999 [1990]).) 

A similar 3'-mismatch strategy is used with greater effect to prevent ligation in the 
LCR (Barany, PGR Meth. Applic, 1 :5 [1991]). Any mismatch effectively blocks the 
5 action of the thermostable ligase, but LCR still has the drawback of target-independent 
background Ugation products initiating the amphfication. Moreover, the combination of 
PGR with subsequent LGR to identify the nucleotides at individual positions is also a 
clearly cumbersome proposition for the clinical laboratory. 

' 10 n. Direct Detection Technology 

^ When a sufficient amount of a nucleic acid to be detected is available, there are 

m advantages to detecting that sequence directly, instead of making more copies of that 
target, {e.g., as in PGR and LGR). Most notably, a method that does not amphfy the 
jj| signal exponentially is more amenable to quantitative analysis. Even if the signal is 
''^ 1 5 enhanced by attaching multiple dyes to a single oligonucleotide, the correlation between 
Q the final signal intensity and amount of target is du:ect. Such a system has an additional 
p| advantage that the products of the reaction will not themselves promote further reaction, 
Ji so contamination of lab svirfaces by the products is not as much of a concern. Traditional 

Rl methods of direct detection including Northern and Southern blotting and RNase 

20 protection assays usually require the use of radioactivity and are not amenable to 
automation. Recently devised techniques have sought to eliminate the use of 
radioactivity and/or improve the sensitivity in automatable formats. Two examples are 
the "Cycling Probe Reaction" (CPR), and "Branched DNA" (bDNA) 

The cycling probe reaction (CPR) (Duck et al, BioTech., 9:142 [1990]), uses a 
25 long chimeric oligonucleotide in which a central portion is made of RNA while the two 
termini are made of DNA. Hybridization of the probe to a target DNA and exposure to a 
thermostable RNase H causes the RNA portion to be digested. This destabilizes the 
remaining DNA portions of the duplex, releasing the remainder of the probe from the 
target DNA and allowing another probe molecule to repeat the process. The signal, in the 
30 form of cleaved probe molecules, accumulates at a linear rate. While the repeating 
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process increases the signal, the RNA portion of the oUgonucleotide is vulnerable to 
RNases that may be carried through sample preparation. 

Branched DNA (bDNA), described by Urdea et al. Gene 61 :253-264 (1987), 
involves oligonucleotides with branched structures that allow each individual 

oligonucleotide to carry 35 to 40 labels {e.g., alkaline phosphatase enzymes). While this 
enhances the signal from a hybridization event, signal from non-specific binding is 
similarly increased. 

While both of these methods have the advantages of direct detection discussed 
above, neither the CPR or bDNA methods can make use of the specificity allowed by the 
requirement of independent recognition by two or more probe (oHgonucleotide) 
sequences, as is common in the signal amplification methods described in Section 1. 
above. The requirement that two oligonucleotides must hybridize to a target nucleic acid 
in order for a detectable signal to be generated confers an extra measure of stringency on 
any detection assay. Requiring two oUgonucleotides to bind to a target nucleic acid 
reduces the chance that false "positive" results will be produced due to the non-specific 
binding of a probe to the target. The fiirther requirement that the two oUgonucleotides 
must bind in a specific orientation relative to the target, as is required in PGR, where 
oUgonucleotides must be oppositely but appropriately oriented such that the DNA 
polymerase can bridge the gap between the two oligonucleotides in both directions, 
fiorther enhances specificity of the detection reaction. However, it is weU known to those 
in the art that even though PGR utilizes two oUgonucleotide probes (termed primers) 
"non-specific" ampUfication {le., ampUfication of sequences not directed by the two 
primers used) is a common artifact. This is in part because the DNA polymerase used in 
PGR can accommodate very large distances, measured in nucleotides, between the 
oUgonucleotides and thus there is a large window in which non-specific binding of an 
oUgonucleotide can lead to exponential amplification of inappropriate product. The 
LCR, in contrast, cannot proceed unless the oUgonucleotides used are bound to the target 
adjacent to each other and so the fiiU benefit of the dual oUgonucleotide hybridization is 
realized. 
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An ideal direct detection method would combine the advantages of the direct 
detection assays (e.g., easy quantification and minimal risk of carry-over contamination) 
with the specificity provided by a dual oUgonucleotide hybridization assay. 



5 III. Direct Detection of RNA 

In molecular medicine, a simple and cost-effective method for direct and 
quantitative RNA detection would greatly facilitate the analysis of RNA viruses and the 
measurement of specific gene expression. Both of these issues are currently pressing 
problems in the field. Despite this need, few techniques have emerged that are truly 
10 direct. PCR-based detection assays require conversion of RNA to DNA by reverse 

transcriptase before amphfication, introducing a variable that can compromise accurate 
quantification. Furthermore, PGR and other methods based on exponential amplification 
(e.g,, NASBA) require painstaking containment measures to avoid cross-contamination, 
;Jr and have difficulty distinguishing small differences (e.g., 2 to 3-fbld) in quantity. Other 
W 15 tests that directly examine RNA suffer from a variety of drawbacks, including time 

consuming autoradiography steps (e.g., RNase protection assays), or overnight reaction 
times (e.g., branched DNA assays). With over 1,5 miUion viral load measurements being 
performed in the U.S. every year, there is clearly an enormous potential for an 
inexpensive, rapid, high-throughput system for the quantitative measurement of RNA. 
20 Techniques for direct, quantitative detection of mRNA are vital for monitoring 

expression of a number of different genes, hi particular, levels of cytokine expression 
(e.g., interleukins and lymphokines) are being exploited as clinical measures of immune 
response in the progression of a wide variety of diseases (Van Deuren et al.,L Int. Fed. 
Clin. Chem., 5:216 [1993], Van Deuren et al, J. Inf Dis., 169:157 [1994], Perenboom et 
25 al, Eur. J. Clin, hivest, 26:159 [1996], Guidotti et al, hnmunity 4:25 [1996]) as well as 
in monitoring transplant recipients (Grant et al. Transplantation 62:910 [1996]), 
Additionally, the monitoring of viral load and identification of viral genotype have great 
clinical significance for individuals suffering viral infections by such pathogens as HIV 
or Hepatitis C virus (HCV). There is a high correlation between viral load (i.e., the 
30 absolute number of viral particles in the bloodstream) and time to progression to AIDS 
(Mellors et al. Science 272:1 167 [1996], Saag et al. Nature Medicine 2:625 [1996]). 
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For that reason, viral load, as measured by quantitative nucleic acid based testing, is 
becoming a standard monitoring procedure for evaluating the efficacy of treatment and 
the clinical status of HIV positive patients. It is thought to be essential to reduce viral 
load as early in the course of infection as possible and to evaluate viral levels on a regular 
basis. In the case of HCV, viral genotype has great clinical significance, with 
correlations to severity of liver disease and responsiveness to interferon therapy. 
Furthermore, because HCV cannot be grown in culture, it is only by establishing 
correlations between characteristics like viral genotype and clinical outcome that new 
antiviral treatments can be evaluated. 

While the above mentioned methods have been serviceable for low throughput, 
research applications, or for limited clinical application, it is clear that large scale 
quantitative analysis of RNA readily adaptable to any genetic system will require a more 
innovative approach. An ideal direct detection method would combine the advantages of 
the direct detection assays {e.g., easy quantification and minimal risk of carry-over 
contamination) with the specificity provided by a dual oligonucleotide hybridization 
assay. 

Many of the methods described above rely on hybridization alone to distinguish a 
target molecule from other nucleic acids. Although some of these methods can be highly 
sensitive, they often cannot quantitate and distinguish closely related mRNAs accurately, 
especially such RNAs expressed at different levels in the same sample. While the 
above-mentioned methods are serviceable for some purposes, a need exists for a 
technology that is particularly adept at distinguishing particular RNAs from closely 
related molecules. 

SUMMARY OF THE INVENTION 

The present invention relates to compositions and methods for the detection and 
characterization of nucleic acid sequences and variations in nucleic acid sequences. The 
present invention relates to methods for forming a nucleic acid cleavage structure on a 
target sequence and cleaving the nucleic acid cleavage structure in a site-specific manner. 
For example, in some embodiments, the 5' nuclease activity of a variety of enzymes is 
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used to cleave the target-dependent cleavage structure, thereby indicating the presence of 
specific nucleic acid sequences or specific variations thereof. 

The present invention provides structure-specific cleavage agents (e.g., nucleases) 
fi-om a variety of sources, including mesophilic, psychrophiUc, thermophilic, and 
hyperthermophilic organisms. The preferred structure-specific nucleases are 
thermostable. Thermostable structure-specific nucleases are contemplated as particularly 
usefiil in that they operate at temperatures where nucleic acid hybridization is extremely 
specific, allowing for allele-specific detection (including single-base mismatches). In one 
embodiment, the thermostable structure-specific nucleases are thermostable 5' nucleases 
comprising altered polymerases derived firom the native polymerases of Thermus 
species, including, but not limited to Thermus aquaticus, Thermus flavus, and Thermus 
thermophilus. However, the invention is not limited to the use of thermostable 5' 
nucleases. Thermostable structure-specific nucleases from the FEN-1, RAD2 and XPG 
class of nucleases are also preferred. 

The present invention provides a method for detecting a target sequence (e.g., a 
mutation, polymorphism, etc), comprising providing a sample suspected of containing the 
target sequence; oligonucleotides capable of forming an invasive cleavage structure in the 
presence of the target sequence; and an agent for detecting the presence of an invasive 
cleavage structure; and exposing the sample to the ohgonucleotides and the agent. In 
some embodiments, the method further comprises the step of detecting a complex 
comprising the agent and the invasive cleavage structure (directly or indirectly). Li some 
embodiments, the agent comprises a cleavage agent. In some preferred embodiments, the 
exposing of the sample to tiie ohgonucleotides and the agent comprises exposing the 
sample to the ohgonucleotides and the agent under conditions wherein an invasive 
cleavage structure is formed between the target sequence and the oligonucleotides if the 
target sequence is present in the sample, wherein the invasive cleavage structure is 
cleaved by the cleavage agent to form a cleavage product. In some embodiments, the 
method further comprises the step of detecting the cleavage product. In some 
embodiments, the target sequence comprises a first region and a second region, the 
second region downstream of and contiguous to the first region, and whereui the 
oligonucleotides comprise first and second ohgonucleotides, wherein at least a portion of 
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the first oligonucleotide is completely complementary to the first portion of the target 
sequence and wherein the second oUgonucleotide comprises a 3' portion and a 5' portion, 
wherein the 5' portion is completely complementary to the second portion of said target 
nucleic acid. 

The present invention also provides a kit for detecting such target sequences, said 
kit comprising oUgonucleotides capable of forming an invasive cleavage structure in the 
presence of the target sequence, hi some embodiments, the kit fiirther comprises an agent 
for detecting the presence of an invasive cleavage structure (e.g., a cleavage agent). In 
some embodiments, the ohgonucleotides comprise first and second oUgonucleotides, said 
first ohgonucleotide comprising a 5' portion complementary to a first region of the target 
nucleic acid and said second oUgonucleotide comprising a 3' portion and a 5' portion, said 
5' portion complementary to a second region of the target nucleic acid downstream of and 
contiguous to the first portion. In some preferred embodiments, the target sequence 
comprises 

The present invention also provides methods for detecting the presence of a target 
nucleic acid molecule by detecting non-target cleavage products comprising providing: a 
cleavage agent; a source of target nucleic acid, the target nucleic acid comprising a first 
region and a second region, the second region downstream of and contiguous to the first 
region; a first oligonucleotide, wherein at least a portion of the first oligonucleotide is 
completely complementary to the first portion of the target nucleic acid; and a second 
oUgonucleotide comprising a 3' portion and a 5* portion, wherein the 5* portion is 
completely complementary to the second portion of the target nucleic acid; mixing the 
cleavage agent, the target nucleic acid, the first oligonucleotide and the second 
oligonucleotide to create a reaction mixture under reaction conditions such that at least 
the portion of the first oUgonucleotide is annealed to the first region of said target nucleic 
acid and wherein at least the 5' portion of the second oUgonucleotide is annealed to the 
second region of the target nucleic acid so as to create a cleavage structure, and wherein 
cleavage of the cleavage structure occurs to generate non-target cleavage product; and 
detecting the cleavage of the cleavage structure. 

The detection of the cleavage of the cleavage structure can be carried out in any 
manner. In some embodiments, the detection of the cleavage of the cleavage structure 
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comprises detecting the non-target cleavage product. In yet other embodiments, the 
detection of the cleavage of the cleavage structure comprises detection of fluorescence, 
mass, or fluorescence energy transfer. Other detection methods include, but are not 
limited to detection of radioactivity, luminescence, phosphorescence, fluorescence 
5 polarization, and charge. In some embodiments, detection is carried out by a method 
comprising providing the non-target cleavage product; a composition comprising two 
single-stranded nucleic acids annealed so as to define a single-stranded portion of a 
protein binding region; and a protein; and exposing the non-target cleavage product to the 
single-stranded portion of the protein binding region under conditions such that the 
• 10 protein binds to the protein binding region. In some embodiments, the protein comprises 
a nucleic acid producing protein, wherein the nucleic acid producing protein binds to the 
0 protein-binding region and produces nucleic acid. In some embodiments, the protein- 
git binding region is a template-dependent RNA polymerase binding region (e.g., a T7 RNA 

polymerase binding region). In other embodiments, the detection is carried out by a 
IM 1 5 method comprising providing the non-target cleavage product; a single continuous strand 
^f^ of nucleic acid comprising a sequence defining a single strand of an RNA polymerase 
!3 binding region; a template-dependent DNA polymerase; and a template-dependent RNA 
il polymerase; exposing the non-target cleavage product to the RNA polymerase bindmg 
region under conditions such that the non-target cleavage product binds to a portion of 
^1 20 the single strand of the RNA polymerase binding region to produce a bound non-target 
cleavage product; exposing the bound non-target cleavage product to the template- 
dependent DNA polymerase under conditions such that a double-stranded RNA 
polymerase binding region is produced; and exposing the double-stranded RNA 
polymerase binding region to the template-dependent RNA polymerase under conditions 
25 such that RNA transcripts are produced. In some embodiments, the method further 

comprises the step of detecting the RNA transcripts. In some embodiments, the template- 
dependent RNA polymerase is T7 RNA polymerase. 

The present invention is not limited by the nature of the 3' portion of the second 
oligonucleotide. In some preferred embodiments, the 3' portion of the second 
30 oligonucleotide comprises a 3' terminal nucleotide not complementary to the target 
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nucleic acid. In some embodiments, the 3' portion of the second ohgonucleotide consists 
of a single nucleotide not complementary to the target nucleic acid. 

Any of the components of the method may be attached to a solid support. For 
example, in some embodiments, the first oligonucleotide is attached to a soUd support. In 
other embodiments, the second oligonucleotide is attached to a solid support. 

The cleavage agent can be any agent that is capable of cleaving invasive cleavage 
structures. In some preferred embodiments, the cleavage agent comprises a structure- 
specific nuclease. In particularly preferred embodiments, the structure-specific nuclease 
comprises a thermostable structure-specific nuclease (e.g., a thermostable 5' nuclease). 
Thermostable structure-specific nucleases include, but are not limited to, those having an 
amino acid sequence homologous to a portion of the amino acid sequence of a 
thermostable DNA polymerase derived from a thermophihc organism (e.g., Thermus 
aquaticus, Thermus flavus, and Thermus thermophilus). In other embodiments, the 
thermostable structure-specific nuclease comprises a nuclease fromthe FEN-1, RAD2 or 
XPG classes of nucleases, or chimerical structures containing one or more portions of any 
of the above cleavage agents. 

The method is not limited by the nature of the target nucleic acid. In some 
embodiments, the target nucleic acid is single stranded or double stranded DNA or RNA. 
In some embodiments, double stranded nucleic acid is rendered single stranded (e.g., by 
heat) prior to formation of the cleavage structure. In some embodiment, the source of 
target nucleic acid comprises a sample containing genomic DNA. Sample include, but 
are not limited to, blood, saliva, cerebral spinal fluid, pleural fluid, milk, lymph, sputum 
and semen. 

In some embodiments, the reaction conditions for the method comprise providing 
a source of divalent cations. In some preferred embodiments, the divalent cation is 
selected from the group comprising Mn2+ and Mg2+ ions. In some embodiments, the 
reaction conditions for the method comprise providing the first and the second 
oUgonucleotides in concentration excess compared to the target nucleic acid. 

In some embodiments, the method further comprises providing a third 
ohgonucleotide complementary to a third portion of said target nucleic acid upstream of 
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the first portion of the target nucleic acid, wherein the third oUgonucleotide is mixed with 
the reaction mixture. 

The present invention also provides a method for detecting the presence of a 
target nucleic acid molecule by detecting non-target cleavage products comprising 
5 providing: a cleavage agent; a source of target nucleic acid, the target nucleic acid 
comprising a first region and a second region, the second region downstream of and 
contiguous to the first region; a plurality of first oligonucleotides, wherein at least a 
portion of the furst ohgonucleotides is completely complementary to the first portion of 
the target nucleic acid; a second oUgonucleotide comprising a 3' portion and a 5' portion, 

^ 10 wherein said 5* portion is completely complementary to the second portion of the target 
nucleic acid; mixing the cleavage agent, the target nucleic acid, the plurahty of first 
ohgonucleotides and second oligonucleotide to create a reaction mixture under reaction 

O conditions such that at least the portion of a first oUgonucleotide is annealed to the first 

2 region of the target nucleic acid and wherein at least the 5' portion of the second 

^3 1 5 oligonucleotide is annealed to the second region of the target nucleic acid so as to create a 
'0 cleavage structure, and wherein cleavage of the cleavage structure occurs to generate 
ip non-target cleavage product, wherein the conditions permit multiple cleavage structures 

3 to form and be cleaved fi-om the target nucleic acid; and detecting the cleavage of said 
M cleavage structures. In some embodiments, the conditions comprise isothermal 

S 20 conditions that permit the plurahty of first oUgonucleotides to dissociate firom the target 
nucleic acid. While the present invention is limited by the number of cleavage structure 
formed on a particular target nucleic acid, in some preferred embodiments, two or more 
(3, 4, 5, . . 10, . . 10000, . . .) of the plurality of first oligonucleotides form cleavage 
structures with a particular target nucleic acid, wherein the cleavage structures are 

25 cleaved to produce the non-target cleavage products. 

The present invention also provides methods wherein a cleavage product firom the 
above methods is used in a fiirther invasive cleavage reaction. For example, the present 
invention provides a method comprismg providing a cleavage agent; a first target nucleic 
acid, the first target nucleic acid comprising a fu^t region and a second region, the second 

30 region downstream of and contiguous to the first region; a first oligonucleotide, wherein 
at least a portion of the fust oUgonucleotide is completely complementary to the first 
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portion of the first target nucleic acid; a second oligonucleotide comprising a 3' portion 
and a 5' portion, wherein the 5' portion is completely complementary to the second 
portion of the first target nucleic acid; a second target nucleic acid, said second target 
nucleic acid comprising a first region and a second region, the second region downstream 
5 of and contiguous to the first region; and a third oHgonucleotide, wherein at least a 
portion of the third oUgonucleotide is completely complementary to the first portion of 
the second target nucleic acid; generating a first cleavage structure wherein at least said 
portion of the first oUgonucleotide is annealed to the first region of the first target nucleic 
acid and wherein at least the 5' portion of the second oligonucleotide is annealed to the 
10 second region of the first target nucleic acid and wherein cleavage of the first cleavage 
structure occurs via the cleavage agent thereby cleaving the first oUgonucleotide to 
0 generate a fourth oligonucleotide, said fourth oUgonucleotide comprising a 3' portion and 
m a 5* portion, wherein the 5' portion is completely complementary to the second portion of 
J the second target nucleic acid; generating a second cleavage structure under conditions 
W 1 5 wherein at least said portion of the third oligonucleotide is annealed to the first region of 
,f the second target nucleic acid and wherein at least the 5* portion of the fourth 
y oUgonucleotide is annealed to the second region of the second target nucleic acid and 
n wherein cleavage of the second cleavage structure occurs to generate a cleavage 

q fi-agment; and detecting the cleavage of the second cleavage structure. In some preferred 

20 embodiments, the 3' portion of the fourth oligonucleotide comprises a 3' terminal 

nucleotide not complementary to the second target nucleic acid. In some embodiments, 
the 3' portion of the third oUgonucleotide is covalently linked to the second target nucleic 
acid. In some embodiments, the second target nucleic acid further comprises a 5' region, 
wherein the 5' region of the second target nucleic acid is the third oligonucleotide. 
25 The present invention further provides kits comprising: a cleavage agent; a first 
oligonucleotide comprising a 5' portion complementary to a first region of a target 
nucleic acid; and a second oUgonucleotide comprising a 3* portion and a 5' portion, said 5' 
portion complementary to a second region of the target nucleic acid downstream of and 
contiguous to the first portion. In some embodiments, the 3* portion of the second 
30 oUgonucleotide comprises a 3' terminal nucleotide not complementary to the target 
nucleic acid. In preferred embodiments, the 3' portion of the second oligonucleotide 
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consists of a single nucleotide not complementary to the target nucleic acid. In some 
embodiments, the kit further comprises a solid support. For example, in some 
embodiments, the first and/or second oligonucleotide is attached to said solid support, hi 
some embodiments, the kit further comprises a buffer solution. In some preferred 
5 embodiments, the buffer solution comprises a source of divalent cations (e.g., Mn2+ 
and/or Mg^"*" ions). In some specific embodiments, the kit further comprises a third 
oUgonucleotide complementary to a third portion of the target nucleic acid upstream of 
the first portion of the first target nucleic acid. In yet other embodiments, the kit further 
comprises a target nucleic acid. In some embodiments, the kit further comprises a second 
10 target nucleic acid. In yet other embodiments, the kit further comprises a third 
U oligonucleotide comprising a 5' portion complementary to a first region of the second 
y target nucleic acid. In some specific embodiments, the 3' portion of the third 
SI oligonucleotide is covalently linked to the second target nucleic acid. In other specific 
Si embodiments, the second target nucleic acid further comprises a 5* portion, wherein the 5' 

15 portion of the second target nucleic acid is the third oUgonucleotide. In still other 
n embodiments, the kit further comprises an ARRESTOR molecule (e.g., ARRESTOR 

[if oligonucleotide). 

[f The present invention further provides a composition comprising a cleavage 

0 structure, the cleavage structure comprising: a) a target nucleic acid, the target nucleic 
' ^ 20 acid having a first region, a second region, a third region and a fourth region, wherein the 
first region is located adjacent to and downstream from the second region, the second 
region is located adjacent to and downstream firom the third region and the third region is 
located adjacent to and downstream from the fourth region; b) a first oligonucleotide 
complementary to the fourth region of the target nucleic acid; c) a second oligonucleotide 
25 having a 5' portion and a 3' portion wherein the 5* portion of the second oUgonucleotide 
contains a sequence complementary to the second region of the target nucleic acid and 
wherein the 3' portion of the second oligonucleotide contains a sequence complementary 
to the third region of the target nucleic acid; and d) a third oligonucleotide having a 5' 
portion and a 3' portion wherein the 5' portion of the third oligonucleotide contains a 
30 sequence complementary to the first region of the target nucleic acid and wherein the 3' 
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portion of the third oUgonucleotide contains a sequence complementary to the second 
region of the target nucleic acid. 

The present mvention is not limited by the length of the four regions of the target 
nucleic acid. In one embodiment, the first region of the target nucleic acid has a length of 
5 1 1 to 50 nucleotides. In another embodiment, the second region of the target nucleic acid 
has a length of one to three nucleotides. In another embodiment, the third region of the 
target nucleic acid has a length of six to nine nucleotides. In yet another embodiment, the 
fourth region of the target nucleic acid has a length of 6 to 50 nucleotides. 

The invention is not Umited by the nature or composition of the of the first, 
10 second, third and fourth oUgonucleotides; these oligonucleotides may comprise DNA, 
RNA, PNA and combinations thereof as well as comprise modified nucleotides, universal 
Q bases, adducts, etc. Further, one or more of the first, second, third and the fourth 
S oligonucleotides may contain a dideoxynucleotide at the 3' terminus. 
^ In one preferred embodiment, the target rmcleic acid is not completely 

15 complementary to at least one of the first, the second, the third and the fourth 
1^ oligonucleotides. In a particularly preferred embodiment, the target nucleic acid is not 
p completely complementary to the second oligonucleotide. 

As noted above, the present invention contemplates the use of structure-specific 
; J nucleases in detection methods. In one embodiment, the present invention provides a 
f| 20 method of detecting the presence of a target nucleic acid molecule by detecting non- 
target cleavage products comprising: a) providing: i) a cleavage means, ii) a source of 
target nucleic acid, the target nucleic acid having a first region, a second region, a third 
region and a fourth region, wherein the first region is located adjacent to and downstream 
fi-om the second region, the second region is located adjacent to and downstream from the 
25 third region and the third region is located adj acent to and downstream fi^om the fourth 
region; iii) a first oligonucleotide complementary to the fourth region of the target nucleic 
acid; iv) a second oUgonucleotide having a 5* portion and a 3' portion wherein the 5' 
portion of the second oUgonucleotide contains a sequence complementary to the second 
region of the target nucleic acid and wherein the 3* portion of the second oUgonucleotide 
30 contains a sequence complementary to the third region of the target nucleic acid; iv) a 
third oligonucleotide having a 5' and a 3' portion wherein the 5' portion of the third 
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oligonucleotide contains a sequence complementary to the first region of the target 
nucleic acid and wherein the 3' portion of the third oUgonucleotide contains a sequence 
complementary to the second region of the target nucleic acid; b) mixing the cleavage 
means, the target nucleic acid, the first oUgonucleotide, the second oligonucleotide and 
the third oligonucleotide to create a reaction mixture under reaction conditions such that 
the first oligonucleotide is annealed to the fourth region of the target nucleic acid and 
wherein at least the 3' portion of the second oligonucleotide is annealed to the target 
nucleic acid and wherein at least the 5' portion of the third oUgonucleotide is annealed to 
the target nucleic acid so as to create a cleavage structure and wherein cleavage of the 
cleavage structure occurs to generate non-target cleavage products, each non-target 
cleavage product having a 3'-hydroxyl group; and c) detecting the non-target cleavage 
products. 

The invention is not limited by the nature of the target nucleic acid. In one 
embodiment, the target nucleic acid comprises single-stranded DNA. hi another 
embodiment, the target nucleic acid comprises double-stranded DNA and prior to step c), 
the reaction mixture is treated such that the double-stranded DNA is rendered 
substantially single-stranded. In another embodiment, the target nucleic acid comprises 
RNA and the first and second oligonucleotides comprise DNA. 

The invention is not limited by the nature of the cleavage means, hi one 
embodiment, the cleavage means is a structure-specific nuclease; particularly preferred 
structure-specific nucleases are thermostable structure-specific nucleases. 

In another preferred embodiment the thermostable structure specific nuclease is a 
chimerical nuclease. 

In an alternative preferred embodiment, the detection of the non-target cleavage 
products comprises electrophoretic separation of the products of the reaction followed by 
visuaUzation of the separated non-target cleavage products. 

In another preferred embodiment, one or more of the first, second, and third 
oUgonucleotides contain a dideoxynucleotide at the 3' terminus. When 
dideoxynucleotide-containing oUgonucleotides are employed, the detection of the non- 
target cleavage products preferably comprises: a) incubating the non-target cleavage 
products with a template-independent polymerase and at least one labeled nucleoside 
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triphosphate under conditions such that at least one labeled nucleotide is added to the 3- 
hydroxyl group of the non-target cleavage products to generate labeled non-target 
cleavage products; and b) detecting the presence of the labeled non-target cleavage 
products. The invention is not limited by the nature of the template-independent 
5 polymerase employed; in one embodiment, the template-independent polymerase is 
selected from the group consisting of terminal deoxynucleotidyl transferase (TdT) and 
poly A polymerase. When TdT or polyA polymerase are employed in the detection step, 
the second oUgonucleotide may contain a 5' end label, the 5' end label being a different 
label tiian the label present upon the labeled nucleoside triphosphate. The invention is 
• 10 not limited by the nature of the 5' end label; a wide variety of suitable 5' end labels are 

known to the art and include biotin, fluorescein, tetrachlorofluorescein, 
Q hexachlorofluorescein, Cy3 amidite, Cy5 amidite and digoxigenin. 
jj In another embodiment, detecting the non-target cleavage products comprises: a) 

J incubating the non-target cleavage products with a template-independent polymerase and 
W 15 at least one nucleoside triphosphate under conditions such that at least one nucleotide is 
r added to the 3'-hydroxyl group of the non-target cleavage products to generate tailed non- 
!i target cleavage products; and b) detecting the presence of the tailed non-target cleavage 
^1 products. The invention is not limited by the nature of the template-independent 
S polymerase employed; m one embodiment, the template-independent polymerase is 
''^ 20 selected from the group consisting of terminal deoxynucleotidyl transferase (TdT) and 

poly A polymerase. When TdT or polyA polymerases are employed in the detection step, 
the second oligonucleotide may contain a 5' end label. The invention is not Umited by the 
nature of the 5' end label; a wide variety of suitable 5' end labels are known to the art and 
include biotin, fluorescein, tetrachlorofluorescein, hexachlorofluorescein, REDMOND 
25 RED dye, Cy3 amidite, Cy5 amidite and digoxigenin. 

In a preferred embodiment, the reaction conditions comprise providing a source of 
divalent cations; particularly preferred divalent cations are Mn2+ and Mg2+ ions. 

The present invention fijrther provides a method of detecting the presence of a 
target nucleic acid molecule by detecting non-target cleavage products comprising: a) 
30 providing: i) a cleavage means, ii) a source of target nucleic acid, the target nucleic acid 
having a first region, a second region and a third region, wherein the first region is 
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located adjacent to and downstream from the second region and wherein the second 
region is located adjacent to and downstream from the third region; iii) a first 
oligonucleotide having a 5' and a 3' portion wherein the 5' portion of the first 
oligonucleotide contains a sequence complementary to the second region of the target 
nucleic acid and wherein the 3' portion of the first oligonucleotide contains a sequence 
complementary to the third region of the target nucleic acid; iv) a second oligonucleotide 
having a length between eleven to fifteen nucleotides and further having a 5' and a 3' 
portion wh^ein the 5' portion of the second oligonucleotide contains a sequence 
complementary to tiie first region of the target nucleic acid and wherein the 3' portion of 
the second oUgonucleotide contains a sequence complementary to the second region of 
the target nucleic acid; b) mixing the cleavage means, the target nucleic acid, the first 
oligonucleotide and the second oUgonucleotide to create a reaction mixture under 
reaction conditions such that at least tiie 3' portion of the first oUgonucleotide is annealed 
to the target nucleic acid and wherein at least the 5' portion of tiie second oUgonucleotide 
is aimealed to the target nucleic acid so as to create a cleavage structure and wherein 
cleavage of the cleavage structure occurs to generate non-target cleavage products, each 
non-target cleavage product having a 3'-hydroxyl group; and c) detecting the non-target 
cleavage products. In a preferred embodiment the cleavage means is a sti:ucture-specific 
nuclease, preferably a thermostable structure-specific nuclease. 

The invention is not limited by the length of the various regions of the target 
nucleic acid. In a preferred embodiment, tiie second region of tiie target nucleic acid has 
a length between one to five nucleotides. In another preferred embodiment, one or more 
of the first and tiie second oUgonucleotides contain a dideoxynucleotide at the 3' 
terminus. When dideoxynucleotide-containing oUgonucleotides are employed, tiie 
detection of the non-target cleavage products preferably comprises: a) incubating the 
non-target cleavage products witii a template-independent polymerase and at least one 
labeled nucleoside triphosphate under conditions such that at least one labeled nucleotide 
is added to the 3'-hydroxyl group of tiie non-target cleavage products to generate labeled 
non-target cleavage products; and b) detectmg tiie presence of the labeled non-target 
cleavage products. The mvention is not limited by tiie nature of the template-independent 
polymerase employed; in one embodunent, tiie template-independent polymerase is 
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selected from the group consisting of terminal deoxynucleotidyl transferase (TdT) and 
poly A polymerase. When TdT or polyA polymerase is employed in the detection step, 
the second oligonucleotide may contain a 5' end label, the 5' end label being a different 
label than the label present upon the labeled nucleoside triphosphate. The invention is 

5 not limited by the nature of the 5* end label; a wide variety of suitable 5* end labels are 
known to the art and include biotin, fluorescein, tetrachlorofluorescein, 
hexachlorofluorescein, REDMOND RED dye, Cy3 amidite, Cy5 amidite and 
digoxigenin. 

In another embodiment, detecting the non-target cleavage products comprises: a) 
1 0 incubating the non-target cleavage products with a template-independent polymerase and 
at least one nucleoside triphosphate under conditions such that at least one nucleotide is 
ip added to the 3'-hydroxyl group of the non-target cleavage products to generate tailed non- 
y target cleavage products; and b) detecting the presence of the tailed non-target cleavage 
"$ products. The invention is not limited by the nature of the template-independent 
bS 15 polymerase employed; in one embodiment, the template-independent polymerase is 

selected from the group consisting of terminal deoxynucleotidyl transferase (TdT) and 
C3 poly A polymerase. When TdT or polyA polymerases are employed in the detection step, 
jl the second oligonucleotide may contain a 5' end label The invention is not limited by the 
W nature of the 5' end label; a wide variety of suitable 5' end labels are known to the art and 

Rl 20 include biotin, fluorescein, tetrachlorofluorescein, hexachlorofluorescein, REDMOND 
RED dye, Cy3 amidite, Cy5 amidite and digoxigenin. 

The novel detection methods of the invention may be employed for the detection 
of target DNAs and RNAs including, but not limited to, t^get DNAs and RNAs 
comprising wild type and mutant alleles of genes, including genes from humans or other 
25 animals that are or may be associated with disease or cancer. In addition, the methods of 
the invention maybe used for the detection of and/or identification of strains of 
microorganisms, including bacteria, fungi, protozoa, cihates and viruses (and in particular 
for the detection and identification of RNA viruses, such as HCV). 

The present invention further provides novel enzymes designed for direct 
30 detection, characterization and quantitation of nucleic acids, particularly RNA. The 
present invention provides enzymes that recognize specific nucleic acid cleavage 

22 



structures formed on a target RNA sequence and that cleave the nucleic acid cleavage 
structure in a site-specific manner to produce non-target cleavage products. The present 
invention provides enzymes having an improved ability to specifically cleave a DNA 
member of a complex comprising DNA and RNA nucleic acid strands. 
5 For example, the present invention provides DNA polymerases that are altered in 

structure relative to the native DNA polymerases, such that they exhibit altered {e,g., 
improved) performance in detection assays based on the cleavage of a structure 
comprising nucleic acid (e.g., RNA). In particular, the altered polymerases of the present 
invention exhibit improved performance in detection assays based on the cleavage of a 
1 0 DNA member of a cleavage structure (e.g. , an invasive cleavage structure) that comprises 
an RNA target strand. 

p The improved performance in a detection assay may arise from any one of, or a 

combination of several improved features. For example, in one embodiment, the enzyme 
£ of the present invention may have an improved rate of cleavage (kcat) on a specific 
5 1 5 targeted structure, such that a larger amount of a cleavage product may be produced in a 
^ given time span. In another embodiment, the enzyme of the present invention may have 
O a reduced activity or rate in the cleavage of inappropriate or non-specific structures. For 
f?| example, in certain embodiments of the present invention, one aspect of improvement is 

that the differential between the detectable amount of cleavage of a specific structure and 
fl 20 the detectable amount of cleavage of any ahemative structures is increased. As such, it is 
within the scope of the present invention to provide an enzyme having a reduced rate of 
cleavage of a specific target structure compared to the rate of the native enzyme, and 
having a fiirther reduced rate of cleavage of any alternative structures, such that the 
differential between the detectable amount of cleavage of the specific structure and the 
25 detectable amount of cleavage of any ahemative structures is increased. However, the 
present invention is not limited to enzymes that have an improved differential. 

In a preferred embodiment, the enzyme of the present invention is a DNA 
polymerase having an altered nuclease activity as described above, and also having 
altered synthetic activity, compared to that of any native DNA polymerase fi:om which 
30 the enzyme has been derived. It is especially preferred that the DNA polymerase is 
altered such that it exhibits reduced synthetic activity as well as improved nuclease 

23 



activity on RNA targets, compared to that of the native DNA polymerase. Enzymes and 
genes encoding enzymes having reduced synthetic activity have been described (See e.g., 
Kaiser et al, J. Biol. Chem., 274:21387 [1999], Lyamichev et al, Prot Natl. Acad. Sci., 
96:6143 [1999], US. Patents 5,541,311, 5,614,402, 5,795,763 and 6,090,606, 
incorporated herein by reference in their entireties). The present invention contemplates 
combined modifications, such that the resulting 5' nucleases are without interfering 
synthetic activity, and have improved performance in RNA detection assays. 

The present invention contemplates a DNA sequence encoding a DNA 
polymerase altered in sequence relative to the native sequence, such that it exhibits 
altered nuclease activity from that of the native DNA polymerase. For example, in one 
embodiment, the DNA sequence encodes an enzyme having an improved rate of cleavage 
(kcat) on a specific targeted structure, such that a larger amount of a cleavage product may 
be produced in a given time span. Li another embodiment, the DNA encodes an enzyme 
having a reduced activity or rate in the cleavage of inappropriate or non-specific 
structures. In certain embodiments, one aspect of improvement is that the differential 
between the detectable amount of cleavage of a specific structure and the detectable 
amount of cleavage of any altemative structures is increased. It is within the scope of the 
present invention to provide a DNA encoding an enzyme having a reduced rate of 
cleavage of a specific target structure compared to the rate of the native enzyme, and 
having a fiirther reduced rate of cleavage of any altemative structures, such that the 
differential between the detectable amount of cleavage of the specific structure ^d the 
detectable amount of cleavage of any altemative structures is increased. However, the 
present invention is not limited to polymerases that have an improved differential. 

In a preferred embodiment, the DNA sequence encodes a DNA polymerase 
having the altered nuclease activity described above, and also having altered synthetic 
activity, compared to that of any native DNA polymerase from which the improved 
enzyme is derived. It is especially preferred that the encoded DNA polymerase is altered 
such that it exhibits reduced synthetic activity as well as improved nuclease activity on 
RNA targets, compared to that of the native DNA polymerase. 

It is not intended that the invention be limited by the nature of the alteration 
required to introduce altered nuclease activity. Nor is it intended that the invention be 
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limited by the extent of either the alteration, or in the improvement observed. If the 
polymerase is also altered so as to be synthesis modified, it is not intended that the 
invention be limited by the polymerase activity of the modified or unmodified protein, or 
by the nature of the alteration to render the polymerase synthesis modified. 

The present invention contemplates structure-specific nucleases from a variety of 
sources, including, but not limited to, mesophilic, psychrophilic, thermophihc, and 
hyperthermophihc organisms. The preferred structure-specific nucleases are 
thermostable. Thermostable structure-specific nucleases are contemplated as particularly 
usefiil in that they allow the INVADER assay {See e.g., U.S. Pat. Nos. 5,846,717, 
5,985,557, 5,994,069, 6,001,567, and 6,090,543 and PCT Publications WO 97/27214 and 
WO 98/42873, incorporated herein by reference in their entireties) to be operated near the 
melting temperature (Tm) of the downstream probe oligonucleotide, so that cleaved and 
uncleaved probes may cycle on and off the target during the course of the reaction. In 
one embodiment, the thermostable structure-specific enzymes are thermostable 5' 
nucleases that are selected firom the group comprising altered polymerases derived fi*om 
the native polymerases of Thermus species, including, but not limited to, Thermus 
aquaticus, Thermus flavus, Thermus thermophilus, Thermus filiformus, and Thermus 
scotoductus. However, the invention is not limited to the use of thermostable 5' 
nucleases. For example, certain embodiments of the present invention utilize short 
oUgonucleotide probes that may cycle on and off of the target at low temperatures, 
allowing the use of non-thermostable enzymes. 

In some preferred embodiments, the present invention provides a composition 
comprising an enzyme, wherein the enzyme comprises a heterologous fimctional domain, 
wherein the heterologous functional domain provides altered (e.g., improved) 
functionality in a nucleic acid cleavage assay. The present invention is not limited by the 
nature of the nucleic acid cleavage assay. For example, nucleic acid cleavage assays 
include any assay in which a nucleic acid is cleaved, directly or indirectly, in the presence 
of the enzyme. In certain preferred embodiments, the nucleic acid cleavage assay is an 
invasive cleavage assay. In particularly preferred embodiments, the cleavage assay 
utiHzes a cleavage structure having at least one RNA component. In another particularly 
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preferred embodiment, the cleavage assay utilizes a cleavage structxire having at least one 
RNA component, wherein a DNA member of the cleavage structure is cleaved. 

In some preferred embodiments, the enzyme comprises a 5' nuclease or a 
polymerase. In certain preferred embodiments, the 5' nuclease comprises a thermostable 
5 5' nuclease. In other preferred embodiments, the polymerase is altered in sequence 
relative to a naturally occurring sequence of a polymerase such that it exhibits reduced 
DNA synthetic activity from that of the naturally occurring polymerase. In certain 
preferred embodiments, the polymerase comprises a thermostable polymerase {e.g. , a 
polymerase from a Thermus species including, but not limited to, Thermus aquaticns, 
10 Thermus flavus, Thermus thermophilus, Thermus filiformus, and Thermus scotoductus). 
The present invention is not limited by the nature of the altered ftmctionality 
C3 provided by the heterologous functional domain. Illustrative examples of alterations 

n. 

m include, but are not limited to, enzymes where the heterologous ftinctional domam 
S comprises an amino acid sequence {e.g.^ one or more amino acids) that provides an 
y 15 improved nuclease activity, an unproved substrate binding activity and/or improved 
^r*' background specificity in a nucleic acid cleavage assay. 

O The present invention is not limited by the nature of the heterologous ftmctional 

Hi domain. For example, in some embodiments, the heterologous fimctional domain 
% comprises two or more amino acids from a polymerase domain of a polymerase {e.g. , 
il 20 introduced into the enzyme by insertion of a chimeric functional domain or created by 
mutation). In certain preferred embodiment, at least one of the two or more amino acids 
is from a pahn or thumb region of the polymerase domain. The present invention is not 
limited by the identity of the polymerase from which the two or more amino acids are 
selected. In certain preferred embodiments, the polymerase comprises Thermus 
25 thermophilus polymerase. In particularly preferred embodiments, the two or more amino 
acids are from amino acids 300-650 of SEQ E) N0:1. 

The novel enzymes of the invention may be employed for the detection of target 
DNAs and RNAs including, but not limited to, target DNAs and RNAs comprising wild 
type and mutant alleles of genes, including, but not limited to, genes from humans, other 
30 animal, or plants that are or may be associated with disease or other conditions. In 
addition, the enzymes of the invention may be used for the detection of and/or 
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identification of strains of microorganisms, including bacteria, fimgi, protozoa, ciliates 
and viruses (and in particular for the detection and identification of viruses having RNA 
genomes, such as the Hepatitis C and Human Immimodeficiency viruses). For example, 
the present invention provides methods for cleaving a nucleic acid comprising providing: 
5 an enzyme of the present invention and a substrate nucleic acid; and exposing the 
substrate nucleic acid to the enzyme (e.g., to produce a cleavage product that may be 
detected). 

In one embodiment, the present invention provides a thermostable 5' nuclease 
s,., having an amino acid sequence selected fi-om the group comprising SEQ ID N0S:2, 3, 4, 
10 5, 6, 7, 8, 9, 10, 1 1 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 
31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 

0 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 341, 346, 348, 351, 353, 359, 365, 
5 367, 369, 374, 376, 380, 384, 388, 392, 396, 400, 402, 406, 408, 410, 412, 416, 418, 420, 

1 424, 427, 429, 432, 436, 440, 444, 446, 448, 450, 456, 460, 464, 468, 472, 476, 482, 485, 
IM 15 488, 491, 494, 496, 498, 500, 502, 506, 510, 514, 518, 522, 526, 530, 534, 538, 542, 544, 
f 550, 553, 560, 564, 566, 568, 572, 574, 576, 578, 580, 582, 584, 586, 588, 590, 2641- 

Q 2674, 2710, 2712, 2714, 2717, 2719, 2722, 2724, 2726, 2728, 2730, 2732, 2734, 2736, 

11 2738, 2740, 2742, 2744, 2746, 2748, 2750, 2752, 2754, 2756, 2758, 2760, 2762, 2766, 

2768, 2770, 2772, 2774, 2776, 2778, 2780, 2781, 2782, 2783, 2785, 2787, 2789, 2791, 
t 20 2793, 2795, 2797, 2799, 2801, 2803, 2805, 2807, 2809, 2811, 2813, 2815, 2817, 2819, 
2821, 2823, 2825, 2827, 2829, 2831, 2833, 2835, and 2837. In another embodiment, the 
5' nuclease is encoded by a DNA sequence selected from the group comprising of SEQ 
ID NOS:69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 
90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 
25 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 
128, 129, 130, 131, 132, 133, 134, 135, 340, 345, 347, 350, 352, 358, 364, 366, 368, 373, 
375, 379, 383, 387, 391, 395, 399, 401, 405, 407, 409, 411, 415, 417, 419, 423, 426, 428, 
431, 435, 439, 443, 445, 447, 449, 452, 454, 455, 459, 463, 467, 471, 475, 481, 484, 495, 
497, 499, 501, 505, 509, 513, 517, 521, 525, 529, 533, 537, 541, 543, 549, 552, 559, 563, 
30 565, 567, 571, 573, 575, 577, 579, 581, 583, 585, 587, 589, 2675-2709, 2711, 2713, 
2715, 2716, 2718, 2720, 2721, 2723, 2725, 2727, 2729, 2731, 2733, 2735, 2737, 2739, 
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2741, 2743 2745, 2747, 2749, 2751, 2753, 2755, 2757, 2759, 2761, 2763, 2764, 2765, 
2767, 2769, 2771, 2773, 2775, 2777, 2779, 2784, 2786, 2788, 2790, 2792, 2794, 2796, 
2798, 2800, 2802, 2804, 2806, 2808, 2810, 2812, 2814, 2816, 2818, 2820, 2822, 2824, 
2826, 2828, 2830, 2832, 2834, and 2836. 
5 The present invention also provides a recombinant DNA vector comprising DNA 

having a nucleotide sequence encoding a 5' nuclease, the nucleotide sequence selected 
from the group comprising SEQ ID NOS: 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 
81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 
103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 
10 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 340, 345, 347, 
350, 352, 358, 364, 366, 368, 373, 375, 379, 383, 387, 391, 395, 399, 401, 405, 407, 409, 
ill 411, 415, 417, 419, 423, 426, 428, 431, 435, 439, 443, 445, 447, 449, 452, 454, 455, 459, 

|j 463, 467, 471, 475, 481, 484, 495, 497, 499, 501, 505, 509, 513, 517, 521, 525, 529, 533, 

I 537, 541, 543, 549, 552, 559, 563, 565, 567, 571, 573, 575, 577, 579, 581, 583, 585, 587, 

y 15 589, 2675-2709, 2711, 2713, 2715, 2716, 2718, 2720, 2721, 2723, 2725, 2727, 2729, 
f 2731, 2733, 2735, 2737, 2739, 2741, 2743 2745, 2747, 2749, 2751, 2753, 2755, 2757, 

jg 2759, 2761, 2763, 2764, 2765, 2767, 2769, 2771, 2773, 2775, 2777, 2779, 2784, 2786, 

j| 2788, 2790, 2792, 2794, 2796, 2798, 2800, 2802, 2804, 2806, 2808, 2810, 2812, 2814, 

I 2816, 2818, 2820, 2822, 2824, 2826, 2828, 2830, 2832, 2834, and 2836. In a preferred 

f| 20 embodiment, the invention provides a host cell transformed with a recombinant DNA 
vector comprising DNA having a nucleotide sequence encoding a structure-specific 
nuclease, the nucleotide selected from the group comprising sequence SEQ ID NOS: 69, 
70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 
94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 
25 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 
131, 132, 133, 134, 135, 340, 345, 347, 350, 352, 358, 364, 366, 368, 373, 375, 379, 383, 
387, 391, 395, 399, 401, 405, 407, 409, 411, 415, 417, 419, 423, 426, 428, 431, 435, 439, 
443, 445, 447, 449, 452, 454, 455, 459, 463, 467, 471, 475, 481, 484, 495, 497, 499, 501, 
505, 509, 513, 517, 521, 525, 529, 533, 537, 541, 543, 549, 552, 559, 563, 565, 567, 571, 
30 573, 575, 577, 579, 581, 583, 585, 587, 589, 2675-2709, 2711, 2713, 2715, 2716, 2718, 
2720, 2721, 2723, 2725, 2727, 2729, 2731, 2733, 2735, 2737, 2739, 2741, 2743 2745, 
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2747, 2749, 2751, 2753, 2755, 2757, 2759, 2761, 2763, 2764, 2765, 2767, 2769, 2771, 
2773, 2775, 2777, 2779, 2784, 2786, 2788, 2790, 2792, 2794, 2796, 2798, 2800, 2802, 
2804, 2806, 2808, 2810, 2812, 2814, 2816, 2818, 2820, 2822, 2824, 2826, 2828, 2830, 
2832, 2834, and 2836. The invention is not limited by the nature of the host cell 
employed. The art is well aware of expression vectors suitable for the expression of 
nucleotide sequences encoding structure-specific nucleases that can be expressed in a 
variety of prokaryotic and eukaryotic host cells. In a preferred embodiment, the host cell 
is an Escherichia coli cell. 

The present invention provides a method of altering 5* nuclease enzymes relative 
to native 5' nuclease enzymes, such that they exhibit improved performance in detection 
assays based on the cleavage of a structure comprising RNA. In particular, the altered 5' 
nucleases produced by the method of the present invention exhibit improved performance 
in detection assays based on the cleavage of a DNA member of a cleavage structure {e,g,, 
an invasive cleavage structure) that comprises an RNA target strand. The improved 5' 
nucleases resulting from the methods of the present invention may be improved in any of 
the ways discussed herein. Examples of processes for assessing improvement in any 
candidate enzyme are provided. 

For example, the present invention provides methods for producing an altered 
enzyme with improved functionality in a nucleic acid cleavage assay comprising: 
providing an enzyme and a nucleic acid test substrate; introducing a heterologous 
functional domain into the enzyme to produce an altered enzyme; contacting the altered 
enzyme with the nucleic acid test substrate to produce cleavage products; and detecting 
the cleavage products. In some embodiments, the introduction of the heterologous 
functional domain comprises mutating one or more amino acids of the enzyme. In other 
embodiments, the introduction of the heterologous fiinctional domain into the enzyme 
comprises adding a functional domain from a protein {e.g., another enzyme) into the 
enzyme {e.g., substituting functional domains by removing a portion of the enzyme 
sequence prior to adding the functional domain of the protein). In preferred 
embodiments, the nucleic acid test substrate comprises a cleavage structure. In 
particularly preferred embodiment, the cleavage structure comprises an RNA target 
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nucleic acid. In yet other preferred embodiments, the cleavage structure comprises an 
invasive cleavage structure. 

The present invention also provides nucleic acid treatment kits. One preferred 
embodiment is a kit comprising a composition comprising at least one improved 5' 
5 nuclease. Another preferred embodiment provides a kit comprising: a) a composition 
comprising at least one improved 5' nuclease; and b) an INVADER oHgonucleotide and a 
signal probe oligonucleotide. In some embodiments of the kits of the present invention, 
the improved 5' nuclease is derived from a DNA polymerase from a eubacterial species. 
In further embodiments, the eubacterial species is a thermophile. In still further 
10 embodiments, the thermophile is of the genus r/ier/wM5'. In still further embodiments, the 
thermophile is selected from the group consisting of Thermus aquaticus, Thermus flavus, 
y Thermus thermophihis, Thermus filiformus, and Thermus scotoductus. In preferred 

Bl embodiments, the improved 5' nuclease is encoded by DNA selected from the group 
j| comprising SEQ ID NOS: 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 
W 15 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 
r 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 

125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 340, 345, 347, 350, 352, 358, 364, 
[| 366, 368, 373, 375, 379, 383, 387, 391, 395, 399, 401, 405, 407, 409, 411, 415, 417, 419, 

5 423, 426, 428, 431, 435, 439, 443, 445, 447, 449, 452, 454, 455, 459, 463, 467, 471, 475, 
20 481, 484, 495, 497, 499, 501, 505, 509, 513, 517, 521, 525, 529, 533, 537, 541, 543, 549, 
552, 559, 563, 565, 567, 571, 573, 575, 577, 579, 581, 583, 585, 587, 589, 2675-2709, 
271 1, 2713, 2715, 2716, 2718, 2720, 2721, 2723, 2725, 2727, 2729, 2731, 2733, 2735, 
2737, 2739, 2741, 2743 2745, 2747, 2749, 2751, 2753, 2755, 2757, 2759, 2761, 2763, 
2764, 2765, 2767, 2769, 2771, 2773, 2775, 2777, 2779, 2784, 2786, 2788, 2790, 2792, 
25 2794, 2796, 2798, 2800, 2802, 2804, 2806, 2808, 2810, 2812, 2814, 2816, 2818, 2820, 
2822, 2824, 2826, 2828, 2830, 2832, 2834, and 2836. In yet other preferred 
embodiments, the kits further comprise reagents for detecting a nucleic acid cleavage 
product. In further preferred embodiments, the reagents for detecting a cleavage product 
comprise oligonucleotides for use in a subsequent invasive cleavage reaction {See e.g., 
30 U.S. Patent No. 5,994,069). In particularly preferred embodiments, the reagents for the 
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subsequent invasive cleavage reaction comprise a probe labeled with moieties that 
produce a fluorescence resonance energy transfer (FRET) effect. 

The present invention also provides methods for treating nucleic acid, comprising: 
a) providing: a first structure-specific nuclease consisting of an endonuclease in a 
5 solution containing manganese; and a nucleic acid substrate; b) treating the nucleic acid 
substrate with increased temperature such that the substrate is substantially single- 
stranded; c) reducing the temperature under conditions such that the single-stranded 
substrate forms one or more cleavage structures; d) reacting the cleavage means with the 
cleavage structures so that one or more cleavage products are produced; and e) detecting 

10 the one or more cleavage products. In some embodiments of the methods, the 
endonuclease includes, but is not hmited to, CLEAVASE BN enzyme, Thermus 
aquaticus DNA polymerase, Thermus thermophilus DNA polymerase, Escherichia coli 
Exo III, and the Saccharomyces cerevisiae Radl/RadlO complex. In yet other preferred 
embodiments, the nuclease is a 5' nuclease derived fi-om a thermostable DNA polymerase 

1 5 altered in amino acid sequence such that it exhibits reduced DNA synthetic activity from 
that of the wild-type DNA polymerase but retains substantially the same 5* nuclease 
activity of the wild-type DNA polymerase. In yet other embodiments, the nucleic acid is 
selected from the group consisting of RNA and DNA. In further embodiments, the 
nucleic acid of step (a) is double stranded. 

20 The present invention also provides nucleic acid treatment kits, comprising: a) a 

composition comprising at least one purified FEN-1 endonuclease; and b) a solution 
containing manganese. In some embodiments of the kits, the purified FEN-1 
endonuclease is selected from the group consisting Pyrococcus woesei FEN-1 
endonuclease, Pyrococcus furiosus FEN-1 endonuclease, Methanococcus jannaschii 

25 FEN-1 endonuclease, Methanobacterium thermoautotrophicum FEN-1 endonuclease, 
Archaeoglobus fulgidus FEN-1, Sulfolobus solfataricus, Pyrobaculum aerophilum, 
Thermococcus litoralis, Archaeaglobus veneficus, Archaeaglobus profundus, Acidianus 
brierlyi, Acidianus ambivalens, Desulfurococcus amylolyticus, Desulfurococcus mobilis, 
Pyrodictium brockii, Thermococcus gorgonarius, Thermococcus zilligii, Methanopyrus 

30 kandleri, Methanococcus igneus, Pyrococcus horikoshii, Aeropyrum pernix, and 

chimerical FEN-1 endonucleases. In other embodiments, the kits fixrther comprise at 
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least one second structure-specific nuclease. In some preferred embodiments, the second 
nuclease is a 5' nuclease derived from a thermostable DNA polymerase altered in amino 
acid sequence such that it exhibits reduced DNA synthetic activity from that of the wild- 
type DNA polymerase but retains substantially the same 5* nuclease activity of the wild- 
type DNA polymerase. In yet other embodiments of the kits, the portion of the amino 
acid sequence of the second nuclease is homologous to a portion of the amino acid 
sequence of a thermostable DNA polymerase derived from a eubacterial thermophile of 
the genus Thermus. In further embodiments, the thermophile is selected from the group 
consisting of Thermus aquaticus, Thermus flavus ^d Thermus thermophilus. In yet other 
preferred embodiments, the kits further comprise reagents for detecting the cleavage 
products. 

The present invention further provides any of the compositions, mixtures, 
methods, and kits described herein, used in conjunction with endonucleases comprising 
Sulfolobus solfataricus, Pyrobaculum aerophilum, Thermococcus litoralis, 
Archaeaglobus veneficus, Archaeaglobus profundus, Acidianus brierfyi, Acidianus 
ambivalens, Desulfurococcus amylolyticus, Desulfurococcus mobilis, Pyrodictium 
brockiiy Thermococcus gorgonarius, Thermococcus zilligii, Methanopyrus kandleri, 
Methanococcus igneus, Pyrococcus horikoshii, md Aeropynim pernix endonucleases. 
These include compositions comprising purified FEN-1 endonucleases from the 
organisms (including specific endonucleases described by sequences provided herein, as 
well as, variants and homologues), kits comprising these compositions, composition 
comprising chimerical endonucleases comprising at least a portion of the endonucleases 
from these organisms, kits comprising such compositions, compositions comprising 
nucleic acids encoding the endonucleases from these organisms (including vectors and 
host cells), kits comprising such compositions, antibodies generated to the endonucleases, 
mixtures comprising endonucleases from these organisms, methods of using the 
endonuclease in cleavage assays (e.g., invasive cleavage assays, CFLP, etc.), and kits 
containing components useful for such methods. Examples describing the generation, 
structure, use, and characterization of these endonucleases are provided herein. 

The present invention also provides methods for improving the methods and 
enzymes disclosed herein. For example, the present invention provides methods of 
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improving enzymes for any intended purpose (e.g., use in cleavage reactions, 
amplification reactions, binding reactions, or any other use) comprising the step of 
providing an enzyme disclosed herein aad modifying the enzyme (e.g., altering the amino 
acid sequence, adding or subtracting sequence, adding post-translational modifications, 
5 adding any other component whether biological or not, or any other modification). 
Likewise, the present invention provides methods for improving the methods disclosed 
herein comprising, conducting the method steps with one or more changes (e.g., change 
in a composition provided in the method, change in the order of the steps, or addition or 
subtraction of steps). 

. 10 The improved performance in a detection assay may arise firom my one of, or a 

=J combination of several improved features. For example, in one embodiment, the enzyme 
p of the present invention may have an improved rate of cleavage (kcat) on a specific 
5 targeted structure, such that a larger amount of a cleavage product may be produced in a 
W given time span. In another embodiment, the enzyme of the present invention may have 
;|| 15 a reduced activity or rate in the cleavage of inappropriate or non-specific Structures. For 
'^^ example, in certain embodiments of the present invention, one aspect of improvement is 

f4 that the differential between the detectable amount of cleavage of a specific structure and 

f| 

PI the detectable amount of cleavage of any alternative structures is increased. As such, it is 

in 

within the scope of the present invention to provide an enzyme having a reduced rate of 

• *^ 

20 cleavage of a specific target structure compared to the rate of the native enzyme, and 
having a fiirther reduced rate of cleavage of any alternative structures, such that the 
differential between the detectable amount of cleavage of the specific structure and the 
detectable amount of cleavage of any altemative structures is increased. However, the 
present invention is not limited to enzymes that have an improved differential. 

25 In some preferred embodiments, the present invention provides a composition 

comprising an enzyme, wherein the enzyme comprises a heterologous functional domain, 
wherein the heterologous functional domain provides altered (e.g., improved) 
fimctionality in a nucleic acid cleavage assay. The present invention is not limited by the 
nature of the nucleic acid cleavage assay. For example, nucleic acid cleavage assays 

30 include any assay in which a nucleic acid is cleaved, directly or indirectly, in the presence 
of the enzyme. In certain preferred embodiments, the nucleic acid cleavage assay is an 
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invasive cleavage assay. In particularly prefeired embodiments, the cleavage assay 
utilizes a cleavage structure having at least one RNA component, hi another particularly 
preferred embodiment, the cleavage assay utihzes a cleavage structure having at least one 
RNA component, wherein a DNA member of the cleavage structure is cleaved. 

The present invention is not limited by the nature of the altered functionality 
provided by the heterologous functional domain. Illustrative examples of alterations 
include, but are not hmited to, enzymes where the heterologous functional domain 
comprises an amino acid sequence (e.g., one or more amino acids) that provides an 
improved nuclease activity, an improved substrate binding activity and/or improved 
background specificity in a nucleic acid cleavage assay. 

The present invention is not limited by the nature of the heterologous functional 
domain. For example, in some embodiments, the heterologous functional domain 
comprises two or more amino acids from a polymerase domain of a polymerase (e.g., 
introduced into the enzyme by insertion of a chimerical functional domain or created by 
mutation). In certain preferred embodiment, at least one of the two or more amino acids 
is from a palm or thumb region of the polymerase domain. The present invention is not 
limited by the identity of the polymerase from which the two or more amino acids are 
selected. In certain preferred embodiments, the polymerase comprises Thermus 
thermophilus polymerase. In particularly preferred embodiments, the two or more amino 
acids are from amino acids 300-650 of SEQ ID N0:1. 

The novel enzymes of the invention may be employed for the detection of target 
DNAs and RNAs including, but not limited to, target DNAs and RNAs comprismg wild 
type and mutant alleles of genes, including, but not hmited to, genes from humans, other 
animal, or plants that are or may be associated with disease or other conditions. In 
addition, the enzymes of the invention may be used for the detection of and/or 
identification of strains of microorganisms, including bacteria, fungi, protozoa, ciliates 
and viruses (and in particular for the detection and identification of viruses having RNA 
genomes, such as the Hepatitis C and Human Immunodeficiency viruses). For example, 
the present invention provides methods for cleaving a nucleic acid comprising providing: 
an enzyme of the present invention and a substrate nucleic acid; and exposing the 
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substrate nucleic acid to the enzyme (e.g., to produce a cleavage product that may be 
detected). In some embodiments, the substrate nucleic is in a cell lysate sample. 

The present invention also provides a method for detecting the presence of a 
target nucleic acid comprising: cleaving an invasive cleavage structure, said invasive 
5 cleavage structure comprising an RNA target nucleic acid; and detecting the cleavage of 
the invasive cleavage structure. Such an assay may comprise a multiplex assay, wherein 
multiple invasive cleavage structures are cleaved. Such structures include structures 
formed on different target nucleic acids, as well as, structures formed on different 
locations of the sample target nucleic acid. In some embodiments, the target nucleic acid 

' 10 comprises a first region and a second region, said second region downstream of and 
contiguous to said first region. In some embodiments, the invasive cleavage structure 
comprises the target nucleic acid, a first oUgonucleotide, and a second oligonucleotide, 
wherein at least a portion of the first oligonucleotide is completely complementary to the 

PI first portion of the first target nucleic acid, and wherein the second oUgonucleotide 

^ 15 comprises a 3* portion and a 5* portion, wherein the 5* portion is completely 

complementary to said second portion of the target nucleic acid. In some embodiments, 

111 the 3' portion of the second oligonucleotide comprises a 3' terminal nucleotide not 

complementary to said target nucleic acid. In some embodiments, the 3' portion of the 

0 second oligonucleotide consists of a single nucleotide not complementary to the target 

20 nucleic acid. In some embodiments, the method fiirther comprises the steps of forming a 
second invasive cleavage structure comprising a non-target cleavage product and 
cleaving the second invasive cleavage structure. In some embodiments, the invasive 
cleavage structure or the second invasive cleavage comprises an oligonucleotide 
comprising a sequence selected fi-om the group consisting of SEQ ID NO:709-2640. In 
25 other embodiments, the invasive cleavage structure or the second invasive cleavage 

comprises an oligonucleotide comprising a sequence selected from the group consisting 
of SEQ ID NO:169-21 1 and 619-706. In still other embodiments, the invasive cleavage 
structure or the second invasive cleavage comprises an oligonucleotide comprising a 
sequence selected fi'om the group consisting of SEQ ID NO:2837-4004. In some 
30 preferred embodiments, the target nucleic acid comprises a cytochrome P450 RNA or a 
cytokine RNA. In some embodiments, the 
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first region or the second region of the target nucleic acid encompasses a splice junction, 
an exon (or a portion thereof), or an intron (or a portion thereof). In some embodiments, 
the RNA target nucleic acid is provided in a cell lysate. In some embodiments, the fu:st 
oligonucleotide is covalently attached to the second oligonucleotide. Such 

5 oligonucleotides find use, for example, in methods described in U.S. Patent Nos. 

5,714,320 and 5,854,033, herein incorporated by reference in their entireties. The present 
invention also provides kits containing one or more of the components used in the above 
methods. 

10 DEFINITIONS 

To facilitate an understanding of the present invention, a number of terms and 

y phrases are defined below: 

Pl As used herein, the terms "complementary" or "complementarity" are used in 

3^ reference to polynucleotides (Le., a sequence of nucleotides such as an oligonucleotide or 
W 1 5 a target nucleic acid) related by the base-pairing rules. For example, for the sequence " 
^ 5'-A-G-T-3'," is complementary to the sequence " 3'-T-C-A-5\" Complementarity may be 

S| "partial," in which only some of the nucleic acids* bases are matched according to the 
rif base pairing rules. Or, there may be "complete" or "total" complementarity between the 
|n nucleic acids. The degree of complementarity between nucleic acid strands has 

' 20 significant effects on the efficiency and strength of hybridization between nucleic acid 
strands. This is of particular importance in amplification reactions, as well as detection 
methods that depend upon binding between nucleic acids. Either term may also be used 
in reference to individual nucleotides, especially within the context of polynucleotides. 
For example, a particular nucleotide within an oligonucleotide may be noted for its 
25 complementarity, or lack thereof, to a nucleotide within another nucleic acid strand, in 
contrast or comparison to the complementarity between the rest of the oligonucleotide 
and the nucleic acid strand. Nucleotide analogs used to form non-standard base pairs, 
whether with another nucleotide analog (e.g., an IsoC/IsoG base pair), or with a naturally 
occurring nucleotide (e.g., as described in U.S. Patent 5,912,340, herein incorporated by 
30 reference in its entirety) are also considered to be complementary to a base pairing 

partner within the meaning this definition. Further, when nucleotides are known to form 
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pairs with multiple different bases, e.g., IsoG nucleotide ability to pair with IsoC and with 
T nucleotides, each of the bases with which it can form a hydrogen-bonded base-pair falls 
within the meaning of "complementary," as used herein. "Universal" bases, i.e., those 
that can form base pairs with several other bases, such as the "wobble"base inosine, are 

5 considered complementary to those bases with which pairs can be formed. 

The term "homology" and "homologous" refers to a degree of identity. There 
may be partial homology or complete homology. A partially homologous sequence is 
one that is less than 100% identical to another sequence. 

As used herein, the term "hybridization" is used in reference to the pairing of 

1 0 complementary nucleic acids. Hybridization and the strength of hybridization (i. e. , the 
strength of the association between the nucleic acids) is influenced by such factors as the 
degree of complementary between the nucleic acids, stringency of the conditions 
involved, and the T^ of the formed hybrid. "Hybridization" methods involve the 
annealing of one nucleic acid to another, complementary nucleic acid, i.e., a nucleic acid 

1 5 having a complementary nucleotide sequence. The ability of two polymers of nucleic 
acid containing complementary sequences to find each other and anneal through base 
pairing interaction is a well-recognized phenomenon. The initial observations of the 
"hybridization" process by Mamiur and Lane, Proc. Natl. Acad. Sci. USA 46:453 (1960) 
and Doty et al., Proc. Natl. Acad. Sci. USA 46:461 (1960) have been followed by the 

20 refinement of this process into an essential tool of modem biology. 

With regard to complementarity, it is important for some diagnostic appUcations 
to determine whether the hybridization represents complete or partial complementarity. 
For example, where it is desired to detect simply the presence or absence of pathogen 
DNA (such as fi"om a virus, bacterium, fungi, mycoplasma, protozoan) it is only 

25 important that the hybridization method ensures hybridization when the relevant 

sequence is present; conditions can be selected where both partially complementary 
probes and completely complementary probes will hybridize. Other diagnostic 
applications, however, may require that the hybridization method distinguish between 
psutial and complete complementarity. It may be of interest to detect genetic 

30 polymorphisms. For example, human hemoglobin is composed, in part, of four 

polypeptide chains. Two of these chains are identical chains of 141 amino acids (alpha 
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chains) and two of these chains are identical chains of 146 amino acids (beta chains). 
The gene encoding the beta chain is known to exhibit polymorphism. The normal allele 
encodes a beta chain having glutamic acid at the sixth position. The mutant allele 
encodes a beta chain having valine at the sixth position. This difference in amino acids 
has a profound (most profound when the individual is homozygous for the mutant allele) 
physiological impact known clinically as sickle cell anemia. It is well known that the 
genetic basis of the amino acid change involves a single base difference between the 
normal allele DNA sequence and the mutant allele DNA sequence. 

The complement of a nucleic acid sequence as xxsed herein refers to an 
oHgonucleotide which, when aligned with the nucleic acid sequence such that the 5' end 
of one sequence is paired with the 3' end of the other, is in "antiparallel association." 
Certain bases not commonly found in natural nucleic acids may be included in the nucleic 
acids of the present invention and include, for example, inosine and 7-deazaguanine. 
Complementarity need not be perfect; stable duplexes may contain mismatched base 
pairs or unmatched bases. Those skilled in the art of nucleic acid technology can 
determine duplex stability empirically considering a number of variables including, for 
example, the length of the oUgonucleotide, base composition and sequence of the 
oligonucleotide, ionic strength and incidence of mismatched base pairs. 

As used herein, the term " Tm " is used in reference to the "melting temperature." 
The melting temperature is the temperature at which a population of double-stranded 
nucleic acid molecules becomes half dissociated into single strands. Several equations 
for calculating the T^ of nucleic acids are well known in the art. As indicated by 
standard references, a simple estimate of the Tm value may be calculated by the equation: 
Tm = 81.5 + 0,41 (% G + C), when a nucleic acid is in aqueous solution at 1 M NaCl {see 
e.g., Anderson and Young, Quantitative Filter Hybridization, in Nucleic Acid 
Hybridization (1985). Other references (e.g., Allawi, H.T. & SantaLucia, J., Jr. 
Thermodynamics and NMR of internal G.T mismatches in DNA. Biochemistry 36, 
10581-94 (1997) include more sophisticated computations which take structural and 
environmental, as well as sequence characteristics into account for the calculation of Tm- 

As used herein the term "stringency" is used in reference to the conditions of 
temperature, ionic strength, and the presence of other compounds, under which nucleic 
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acid hybridizations are conducted. With "high stringency" conditions, nucleic acid base 
pairing will occur only between nucleic acid fragments that have a high frequency of 
complementary base sequences. Thus, conditions of "weak" or "low" stringency are 
often required when it is desired that nucleic acids that are not completely 
complementary to one another be hybridized or annealed together. 

"High stringency conditions" when used in reference to nucleic acid hybridization 
comprise conditions equivalent to binding or hybridization at 42 C in a solution 
consisting of 5X SSPE (43.8 g/1 NaCl, 6.9 g/1 NaH2P04 H2O and 1.85 g/1 EDTA, pH 
adjusted to 7.4 with NaOH), 0.5% SDS, 5X Denhardf s reagent and 100 ^ig/ml denatured 
salmon sperm DNA followed by washing in a solution comprising O.IX SSPE, 1.0% SDS 
at 42 C when a probe of about 500 nucleotides in length is employed. 

"Medium stringency conditions" when used in reference to nucleic acid 
hybridization comprise conditions equivalent to binding or hybridization at 42 C in a 
solution consisting of 5X SSPE (43.8 g/1 NaCl, 6.9 g/l NaH2P04 H2O and 1.85 g/1 
EDTA, pH adjusted to 7,4 with NaOH), 0.5% SDS, 5X Denhardt's reagent and 100 ^g/ml 
denatured salmon sperm DNA followed by washing in a solution comprising l.OX SSPE, 
1.0% SDS at 42 C when a probe of about 500 nucleotides in length is employed. 

"Low stringency conditions" comprise conditions equivalent to binding or 
hybridization at 42 C in a solution consisting of 5X SSPE (43.8 g/1 NaCl, 6.9 g/1 
NaH2P04 H2O and 1.85 g/1 EDTA, pH adjusted to 7.4 with NaOH), 0.1% SDS, 5X 

Denhardt's reagent [50X Denhardt's contains per 500 ml: 5 g FicoU (Type 400, 
Pharamcia), 5 g BSA (Fraction V; Sigma)] and 100 g/ml denatured salmon sperm DNA 
followed by washing in a solution comprising 5X SSPE, 0.1% SDS at 42 C when a probe 
of about 500 nucleotides in length is employed. 

The term "gene" refers to a DNA sequence that comprises control and coding 
sequences necessary for the production of an RNA having a non-coding function (e.g., a 
ribosomal or transfer RNA), a polypeptide or a precursor. The RNA or polypeptide can 
be encoded by a full-length coding sequence or by any portion of the coding sequence so 
long as the desired activity or function is retained. 

The term "wild-type" refers to a gene or a gene product that has the characteristics 
of that gene or gene product when isolated from a naturally occurring source. A wild- 
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type gene is that which is most frequently observed in a population and is thus arbitrarily 
designated the "normal" or "wild-type" form of the gene. In contrast, the term 
"modified,""mutant," or "polymorphic" refers to a gene or gene product that displays 
modifications in sequence and or functional properties (i.e., altered characteristics) when 
5 compared to the wild-type gene or gene product. It is noted that naturally-occurring 
mutants can be isolated; these are identified by the fact that they have altered 
characteristics when compared to the wild-type gene or gene product. 

The term "recombinant DNA vector" as used herein refers to DNA sequences 
containing a desired heterologous sequence. For example, although Ihe term is not 
10 limited to the use of expressed sequences or sequences that encode an expression product, 
2 in some embodiments, the heterologous sequence is a coding sequence and appropriate 

O DNA sequences necessary for either the replication of the coding sequence in a host 

HI 

^ organism, or the expression of the operably linked coding sequence in a particular host 
^ organism. DNA sequences necessary for expression in prokaryotes include a promoter, 
15 optionally an operator sequence, a ribosome-binding site and possibly other sequences. 
Eukaryotic cells are known to utihze promoters, polyadenlyation signals and enhancers. 
n The term "LTR" as used herein refers to the long terminal repeat found at each 

g end of a proviras (i.e. , the integrated form of a retrovirus). The LTR contams numerous 
2 regulatory signals including transcriptional control elements, polyadenylation signals and 
20 sequences needed for repUcation and integration of the viral genome. The viral LTR is 
divided into three regions called U3, R and U5. 

The U3 region contains the enhancer and promoter elements. The U5 region 
contains the polyadenylation signals. The R (repeat) region separates the U3 and U5 
regions and transcribed sequences of the R region appear at both the 5' and 3* ends of the 
25 viral RNA. 

The term "oligonucleotide" as used herein is defined as a molecule comprising 
two or more deoxyribonucleotides or ribonucleotides, preferably at least 5 nucleotides, 
more preferably at least about 10-15 nucleotides and more preferably at least about 15 to 
30 nucleotides. The exact size will depend on many factors, which in tum depend on the 
30 ultimate function or use of the oligonucleotide. The oligonucleotide may be generated in 
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any maimer, including chemical synthesis, DNA repUcation, reverse transcription, PGR, 
or a combination thereof. 

Because mononucleotides are reacted to make oligonucleotides in a manner such 
that the 5' phosphate of one mononucleotide pentose ring is attached to the 3' oxygen of 
its neighbor in one direction via a phosphodiester linkage, an end of an oUgonucleotide is 
referred to as the "5' end" if its 5' phosphate is not linked to the 3' oxygen of a 
mononucleotide pentose ring and as the "3' end" if its 3' oxygen is not linked to a 5' 
phosphate of a subsequent mononucleotide pentose ring. As used herein, a nucleic acid 
sequence, even if internal to a larger oUgonucleotide, also may be said to have 5' and 3* 
ends. A first region along a nucleic acid strand is said to be upstream of another region if 
the 3' end of the first region is before the 5' end of the second region when moving along 
a strand of nucleic acid in a 5* to 3' direction. 

When two different, non-overlapping oligonucleotides anneal to different regions 
of the same linear complementary nucleic acid sequence, and the 3' end of one 
oUgonucleotide points towards the 5' end of the other, the former may be called the 
"upstream" oUgonucleotide and the latter the "downstream" oUgonucleotide. Similarly, 
when two overlapping oligonucleotides are hybridized to the same Unear complementary 
nucleic acid sequence, with the first oUgonucleotide positioned such that its 5' end is 
upstream of the 5' end of the second oligonucleotide, and the 3' end of the first 
oligonucleotide is upstreatn of the 3' end of the second oligonucleotide, the first 
oUgonucleotide may be called the "upstream" oUgonucleotide and the second 
oligonucleotide maybe called the "downstream" oligonucleotide. 

The term "primer" refers to an oUgonucleotide that is capable of acting as a point 
of initiation of synthesis when placed under conditions in which primer extension is 
initiated. An oligonucleotide "primer" may occur naturally, as in a purified restriction 
digest or may be produced synthetically. 

A primer is selected to be "substantially" complementary to a strand of specific 
sequence of the template. A primer must be sufficiently complementary to hybridize 
with a template strand for primer elongation to occur, A primer sequence need not reflect 
the exact sequence of the template. For example, a non-complementary nucleotide 
fi-agment may be attached to the 5' end of the primer, with the remainder of the primer 
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sequence being substantially complementary to the strand. Non-complementary bases or 
longer sequences can be interspersed into the primer, provided that the primer sequence 
has sufficient complementarity with the sequence of the template to hybridize and 
r thereby form a template primer complex for synthesis of the extension product of the 
5 primer. 

The term "label" as used herein refers to any atom or molecule that can be used to 
provide a detectable (preferably quantifiable) effect, and that cm be attached to a nucleic 
acid or protein. Labels include but are not limited to dyes; radiolabels such as ^^P; 
binding moieties such as biotin; haptens such as digoxgenin; luminogenic, 
10 phosphorescent or fluorogenic moieties; and fluorescent dyes alone or in combination 
ti. with moieties that can suppress or shift emission spectra by fluorescence resonance 
O energy transfer (FRET). Labels may provide signals detectable by fluorescence, 
HI radioactivity, colorimetry, gravimetry, X-ray diffraction or absorption, magnetism, 
m enzymatic activity, and the like. A label may be a charged moiety (positive or negative 
15 charge) or alternatively, may be charge neutral. Labels can include or consist of nucleic 
H acid or protein sequence, so long as the sequence comprising the label is detectable. 

The term "signal" as used herein refers to any detectable effect, such as would be 
^ f caused or provided by a label or an assay reaction. 

p As used herein, the term "detector" refers to a system or component of a system, 

' 20 e.g., an instrument (e.g. a camera, fluorimeter, charge-coupled device, scintillation 

counter, etc.) or a reactive medium (X-ray or camera fihn, pH indicator, etc.), that can 
convey to a user or to another component of a system (e.g., a computer or controller) the 
presence of a signal or effect. A detector can be a photometric or spectrophotometric 
system, which can detect ultraviolet, visible or infrared light, including fluorescence or 
25 chemiluminescence; a radiation detection system; a spectroscopic system such as nuclear 
magnetic resonance spectroscopy, mass spectrometry or surface enhanced Raman 
spectrometry; a system such as gel or capillary electrophoresis or gel exclusion 
chromatography; or other detection systems known in the art, or combinations thereof 
The term "cleavage structure" as used herein, refers to a structure that is formed 
30 by the interaction of at least one probe oUgonucleotide and a target nucleic acid, forming 
a structure comprising a duplex, the resulting structure being cleavable by a cleavage 
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agent, including but not limited to an enzyme. The cleavage structure is a substrate for 
specific cleavage by the cleavage means in contrast to a nucleic acid molecule that is a 
substrate for non-specific cleavage by agents such as phosphodiesterases that cleave 
nucleic acid molecules without regard to secondary structure {Le., no formation of a 
5 duplexed structure is required). 

The term "folded cleavage structure" as used herein, refers to a region of a single- 
stranded nucleic acid substrate containing secondary structure, the region being cleavable 
by an enzymatic cleavage means. The cleavage structure is a substrate for specific 
cleavage by the cleavage means in contrast to a nucleic acid molecule that is a substrate 
10 for non-specific cleavage by agents such as phosphodiesterases that cleave nucleic acid 
molecules without regard to secondary structure (i.e., no folding of the substrate is 
|S required). 

As used herein, the term "folded target" refers to a nucleic acid strand that 
CI contains at least one region of secondary structure (i.e., at least one double stranded 
'fj 1 5 region and at least one single-stranded region within a single strand of the nucleic acid). 

1 A folded target may comprise regions of tertiary structure in addition to regions of 

Q 

f^, secondary structure. 

The term "cleavage means" or "cleavage agent" as used herein refers to any 
g means that is capable of cleaving a cleavage structure, including but not limited to 
20 enzymes. The cleavage means may include native DNAPs having 5' nuclease activity 
(e.g., Taq DNA polymerase, E. coli DNA polymerase I) and, more specifically, modified 
DNAPs having 5* nuclease but lacking synthetic activity. "Structure-specific nucleases" 
or "structure-specific enzymes" are enzymes that recognize specific secondary structures 
in a nucleic acid molecule and cleave these structures. The cleavage means of the 
25 invention cleave a nucleic acid molecule in response to the formation of cleavage 

structures; it is not necessary that the cleavage means cleave the cleavage structure at any 
particular location within the cleavage structure. 

The cleavage means is not restricted to enzymes having solely 5' nuclease 
activity. The cleavage means may include nuclease activity provided from a vmety of 
30 sources including the CLEAVASE enzymes, the FEN-1 endonucleases (including RAD2 
and XPG proteins), Taq DNA polymerase and E. coli DNA polymerase 1. 
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The term "thennostable" when used in reference to an enzyme, such as a 5' 
nuclease, indicates that the enzyme is functional or active (i.e., can perform catalysis) at 
an elevated temperature, i.e., at about SS^'C or higher. 

The term "cleavage products" as used herein, refers to products generated by the 
reaction of a cleavage means with a cleavage structure (i.e., the treatment of a cleavage 
structure with a cleavage means). 

The term "target nucleic acid" refers to a nucleic acid molecule containing a 
sequence that has at least partial complementarity with at least a probe oligonucleotide 
and may also have at least partial complementarity with an INVADER oUgonucleotide. 
The target nucleic acid may comprise single- or double-stranded DNA or RNA, and may 
comprise nucleotide analogs, labels, and other modijBcations. 

The term "probe oUgonucleotide" refers to an oligonucleotide that interacts with a 
target nucleic acid to form a cleavage structure in the presence or absence of an 
INVADER oligonucleotide. When annealed to the target nucleic acid, the probe 
oligonucleotide and target form a cleavage structure and cleavage occurs within the probe 
oligonucleotide. 

The term "non-target cleavage product" refers to a product of a cleavage reaction 
that is not derived from the target nucleic acid. As discussed above, in the methods of the 
present invention, cleavage of the cleavage structure generally occurs within the probe 
oligonucleotide. The fragments of the probe oligonucleotide generated by this target 
nucleic acid-dependent cleavage are "non-target cleavage products." 

The term "INVADER oUgonucleotide" refers to an oUgonucleotide that 
hybridizes to a target nucleic acid at a location near the region of hybridization between a 
probe and the target nucleic acid, wherein the INVADER oligonucleotide comprises a 
portion (e.g., a chemical moiety, or nucleotide — ^whether complementary to that target or 
not) that overlaps with the region of hybridization between the probe and target. In some 
embodiments, the INVADER oUgonucleotide contains sequences at its 3' end that are 
substantially the same as sequences located at the 5' end of a probe oligonucleotide. 

The term "substantially single-stranded" when used in reference to a nucleic acid 
substrate means that the substrate molecule exists primarily as a single strand of nucleic 
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acid in contrast to a double-stranded substrate which exists as two strands of nucleic acid 
which are held together by inter-strand base pairing interactions. 

The term "sequence variation" as used herein refers to differences in nucleic acid 
sequence between two nucleic acids. For example, a wild-type structural gene and a 
5 mutant form of this wild-type structural gene may vary in sequence by the presence of 
single base substitutions and/or deletions or insertions of one or more nucleotides. These 
two forms of the structural gene are said to vary in sequence from one another. A second 
mutant form of the structural gene may exist. This second mutant form is said to vary in 
sequence from both the wild-type gene and the first mutant form of the gene. 
10 The term "liberating" as used herein refers to the release of a nucleic acid 

Ni; fragment from a larger nucleic acid fragment, such as an oUgonucleotide, by the action 

n 

S of, for example, a 5' nuclease such that the released fragment is no longer covalently 
W attached to the remainder of the oligonucleotide. 

m The term "K^n" as used herein refers to the Michaelis-Menten constant for an 

1 5 enzyme and is defined as the concentration of the specific substrate at which a given 
enzyme yields one-half its maximum velocity in an enzyme catalyzed reaction. 
m The term "nucleotide" as used herein includes, but is not Umited to, naturally 

]^ occurring and/or synthetic nucleotides, nucleotide analogs, and nucleotide derivatives. 

CI For example, the term includes naturally occurring DNA or RNA monomers, nucleotides 
20 with backbone modifications such as peptide nucleic acid (PNA) (M. Egholm et al., 
Nature 365:566 [1993]), phosphorothioate DNA, phosphorodithioate DNA, 
phosphoramidate DNA, aminde-linked DNA, MMI-linked DNA, 2'-0-methyl RNA, 
alpha-DNA and methylphosphonate DNA, nucleotides with sugar modifications such as 
2'-0-methyl RNA, 2'-fluoro RNA, 2'-amino RNA, 2'-0-alkyl DNA, 2'-0-allyl DNA, 
25 2'-0-alkynyl DNA, hexose DNA, pyranosyl RNA, and anhydrohexitol DNA, and 

nucleotides having base modifications such as C-5 substituted pyrimidines (substituents 
including fluoro-, bromo- chloro-, iodo-, methyl-, ethyl-, vinyl-, formyl-, ethynyl-, 
propynyl-, alkynyl-, thiazoyl-, imidazolyl-, pyridyl-), 7-deazapurines with C-7 
substituents including fluoro-, bromo-, chloro-, iodo-, methyl-, ethyl-, vinyl-, formyl-, 
30 alkynyl-, alkenyl-, thiazolyl-, imidazolyl-, pyridyl-), inosine and diaminopurine. 
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The tenn "base analog" as used herein refers to modified or non-naturally 
occurring bases such as 7-deaza purines (e.g., 7-deaza-adenine and 7-deaza-guanine); 
bases modified, for example, to provide altered interactions such as non-standard 
basepairing, including, but not limited to: IsoC, Iso G, and other modified bases and 
5 nucleotides described in U.S. Patent Nos. 5,432,272; 6,001,983; 6,037,120; 6,140,496; 
5,912,340; 6,127,121 and 6,143,877, each of which is incorporated herein by reference in 
their entireties; heterocychc base analogs based no the purine or pyrimidine ring systems, 
and other heterocyclic bases. Nucleotide analogs include base analogs and comprise 
modified forms of deoxyribonucleotides as well as ribonucleotides. 
10 The term "polymorphic locus" is a locus present in a population that shows 

variation between members of the population (e.g.., the most common allele has a 
frequency of less than 0.95). hi contrast, a "monomorphic locus" is a genetic locus at 
S[ little or no variations seen between members of the population (generally taken to be a 
locus at which the most common allele exceeds a frequency of 0.95 in the gene pool of 

El 

Id 15 the population). 
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The term "microorganism" as used herem means an organism too small to be 



|3 observed with the unaided eye aad includes, but is not limited to bacteria, virus, 

n 

|i| protozoans, fungi, and ciUates. 

^ The term "microbial gene sequences" refers to gene sequences derived from a 

ri 20 microorganism. 

The term "bacteria" refers to any bacterial species including eubacterial and 
archaebacterial species. 

The term "virus" refers to obligate, ultramicroscopic, intracellular parasites 
incapable of autonomous replication (i.e., replication requires the use of the host cell's 
25 machinery). 

The term "multi-drug resistant" or multiple-drug resistant" refers to a 
microorgmiism that is resistant to more than one of the antibiotics or antimicrobial agents 
used in the treatment of said microorganism. 

The term "sample" in the present specification and claims is used in its broadest 
30 sense. On the one hand it is meant to include a specimen or culture (e.g., microbiological 
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cultures). On the other hand, it is meant to include both biological and environmental 
samples. A sample may include a specimen of synthetic origin. 

Biological samples may be animal, including human, fluid, solid (eg., stool) or 
tissue, as well as liquid and solid food and feed products and ingredients such as dairy 

5 items, vegetables, meat and meat by-products, and waste. Biological samples may be 
obtained from all of the various families of domestic animals, as well as feral or wild 
animals, including, but not limited to, such animals as ungulates, bear, fish, lagamorphs, 
rodents, etc. 

Environmental samples include environmental material such as surface matter, 
10 soil, water and industrial samples, as well as samples obtained from food and dairy 

u processing instruments, apparatus, equipment, utensils, disposable and non-disposable 
y items. These examples are not to be construed as limiting the sample types applicable to 
pf the present invention. 

2 The term "source of target nucleic acid" refers to any sample that contains nucleic 

W 15 acids (RNA or DNA). Particularly preferred sources of target nucleic acids are biological 
n samples including, but not limited to blood, saliva, cerebral spinal fluid, pleural fluid, 

J5 milk, lymph, sputum md semen, 

t -if 

ri An ohgonucleotide is said to be present in "excess" relative to another 

jn oligonucleotide (or target nucleic acid sequence) if that oligonucleotide is present at a 

' 20 higher mol^ concentration that the other oligonucleotide (or target nucleic acid 

sequence). When an oligonucleotide such as a probe oligonucleotide is present in a 
cleavage reaction in excess relative to the concentration of the complementary target 
nucleic acid sequence, the reaction may be used to indicate the amount of the target 
nucleic acid present. Typically, when present in excess, the probe oligonucleotide will be 
25 present at least a 100-fold molar excess; typically at least 1 pmole of each probe 

oligonucleotide would be used when the target nucleic acid sequence was present at 
about 10 finoles or less. 

A sample "suspected of containing" a first and a second target nucleic acid may 
contain either, both or neither target nucleic acid molecule. 
30 The term "charge-balanced" oligonucleotide refers to an ohgonucleotide (the 

input ohgonucleotide in a reaction) that has been modified such that the modified 
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oligonucleotide bears a charge, such that when the modified oUgonucleotide is either 
cleaved (i.e., shortened) or elongated, a resulting product bears a charge different frona 
the input oUgonucleotide (the "charge-unbalanced" oUgonucleotide) thereby permitting 
separation of the input and reacted oHgonucleotides on the basis of charge. The term 
5 "charge-balanced" does not imply that the modified or balanced oligonucleotide has a net 
neutral charge (although this can be the case). Charge-balancing refers to the design and 
modification of an oUgonucleotide such that a specific reaction product generated firom 
this input oUgonucleotide can be separated on the basis of charge fi-om the input 
oligonucleotide. 

1 0 For example, in an INVADER oUgonucleotide-directed cleavage assay in which 

the probe oligonucleotide bears the sequence: 5' TTCTTTTCACCAGCGAGACGGG 3' 
p (ie., SEQ ID NO: 136 without the modified bases) and cleavage of the probe occurs 
2 between the second and third residues, one possible charge-balanced version of this 
P oUgonucleotide would be: 5' Cy3-AminoT-Amino-TCTTTTCACCAGCGAGAC GGG 
y 15 3'. This modified oligonucleotide bears a net negative charge. After cleavage, the 

following oligonucleotides are generated: 5' Cy3-AminoT-Amino-T 3'and 5' 
p CTTTTC ACCAGCGAGACGGG 3^ (residues 3-22of SEQ ID NO: 1 36). 5' Cy3- 
ill AminoT-Amino-T 3' bears a detectable moiety (the positively-charged Cy3 dye) and two 

amino-modified bases. The amino-modified bases and the Cy3 dye contribute positive 
f| 20 charges in excess of the negative charges contributed by the phosphate groups and thus 
the 5' Cy3-AminoT-Amino-T 3 'oUgonucleotide has a net positive charge. The other, 
longer cleavage fi-agment, like the input probe, bears a net negative charge. Because the 
5' Cy3-AminoT-Amino-T S'firagment is separable on the basis of charge fi-om the input 
probe (the charge-balanced oUgonucleotide), it is referred to as a charge-unbalanced 
25 oligonucleotide. The longer cleavage product cannot be separated on the basis of charge 
from the input oligonucleotide as both oUgonucleotides bear a net negative charge; thus, 
the longer cleavage product is not a charge-unbalanced oUgonucleotide. 

The term "net neutral charge" when used in reference to an oligonucleotide, 
including modified oUgonucleotides, indicates that the sum of the charges present (i.e., R- 
30 NH3+ groups on thymidines, the N3 nitrogen of cytosine, presence or absence or 

phosphate groups, etc.) under the desired reaction or separation conditions is essentially 
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zero. An oligonucleotide having a net neutral charge would not migrate in an electrical 
field. 

The term "net positive charge" when used in reference to an oUgonucleotide, 
including modified oligonucleotides, indicates that the sum of the charges present (i.e., R- 
5 NH3+ groups on thymidines, the N3 nitrogen of cytosine, presence or absence or 
phosphate groups, etc.) under the desired reaction conditions is +1 or greater. An 
oligonucleotide having a net positive charge would migrate toward the negative electrode 
in an electrical field. 

The term "net negative charge" when used in reference to an oligonucleotide, 
10 including modified oligonucleotides, indicates that the sum of the charges present (i.e., R- 
NH3+ groups on thymidines, the N3 nitrogen of cytosine, presence or absence or 
phosphate groups, etc.) under the desired reaction conditions is -1 or lower. An 
oligonucleotide having a net negative charge would migrate toward the positive electrode 
in an electrical field. 

15 The term "polymerization means" or "polymerization agent" refers to any agent 

capable of facilitating the addition of nucleoside triphosphates to an oligonucleotide. 

Preferred polymerization means comprise DNA and RNA polymerases. 

The term "Ugation means" or "ligation agent" refers to any agent capable of 

facilitating the ligation (i.e., the formation of aphosphodiester bond between a 3'-0H and 
20 a 5' P located at the termini of two strands of nucleic acid). Preferred Ugation means 

comprise DNA Hgases and RNA Ugases. 

The term "reactant" is used herein in its broadest sense. The reactant can 

comprise, for example, an enzymatic reactant, a chemical reactant or light (e.g., 

ultraviolet hght, particularly short wavelength ultraviolet Hght is known to break 
25 oligonucleotide chains). Any agent capable of reacting with an oligonucleotide to either 

shorten (i.e., cleave) or elongate the oligonucleotide is encompassed within the term 

"reactant." 

The term "adduct" is used herein in its broadest sense to indicate any compound 
or element that can be added to an oligonucleotide. An adduct may be charged 
30 (positively or negatively) or may be charge-neutral. An adduct may be added to the 

oligonucleotide via covalent or non-covalent linkages. Examples of adducts include, but 
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are not limited to, indodicarbocyanine dye amidites, amino-substituted nucleotides, 
ethidium bromide, ethidium homodimer, (l,3-propanediamino)propidium, 
(diethylenetriamino)propidium, thiazole orange, (N-N-tetramethyl-1,3- 
propanediamino)propyl thiazole orange, (N-N*-tetramethyl-l,2-ethanediamino)propyl 
thiazole orange, thiazole orange-thiazole orange homodimer (TOTO), thiazole orange- 
thiazole blue heterodimer (TOTAB), thiazole orange-ethidium heterodimer 1 (TOEDl), 
thiazole orange-ethidium heterodimer 2 (T0ED2) and fluorescein-ethidium heterodimer 
(FED), psoralens, biotin, streptavidin, avidin, etc. 

Where a first oligonucleotide is complementary to a region of a target nucleic acid 
and a second oligonucleotide has complementary to the same region (or a portion of this 
region) a "region of sequence overlap" exists along the target nucleic acid. The degree of 
overlap v^ill vary depending upon the nature of the complementarity (see, e.g., region "X" 
in Figs. 29 and 67 and the accompanying discussions). 

As used herein, the term "purified" or "to purify" refers to the removal of 
contaminants from a sample. For example, recombinant CLEAVASE nucleases are 
expressed in bacterial host cells and the nucleases are purified by the removal of host cell 
proteins; the percent of these recombinant nucleases is thereby increased in the sample. 

The term "recombinant DNA molecule" as used herein refers to a DNA molecule 
that comprises of segments of DNA joined together by means of molecular biological 
techniques. 

The term "recombinant protein" or "recombinant polypeptide" as used herein 
refers to a protein molecule that is expressed from a recombinant DNA molecule. 

As used herein the term "portion" when in reference to a protein (as in "a portion 
of a given protein") refers to fragments of that protein. The fragments may range in size 
from four amino acid residues to the entire amino acid sequence minus one amino acid 
(e.g., 4, 5, 6, . . n-1). 

The term "nucleic acid sequence" as used herein refers to an oligonucleotide, 
nucleotide or polynucleotide, and fragments or portions thereof, and to DNA or RNA of 
genomic or synthetic origin that may be single or double stranded, and represent the 
sense or antisense strand. Similarly, "amino acid sequence" as used herein refers to 
peptide or protein sequence. 
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The term "peptide nucleic acid" ("PNA") as used herein refers to a molecule 
comprising bases or base analogs such as would be found in natural nucleic acid, but 
attached to a peptide backbone rather than the sugar-phosphate backbone typical of 
nucleic acids. The attachment of the bases to the peptide is such as to allow the bases to 
base pair with complementary bases of nucleic acid in a manner similar to that of an 
oligonucleotide. These small molecules, also designated anti gene agents, stop transcript 
elongation by binding to their complementary strand of nucleic acid (Nielsen, et al 
Anticancer Drug Des. 8:53 63 [1993]). 

As used herein, the terms "purified" or "substantially purified" refer to molecules, 
either nucleic or amino acid sequences, that are removed from their natural environment, 
isolated or separated, and are at least 60% free, preferably 75% free, and most preferably 
90% free from other components with which they are naturally associated. An "isolated 
polynucleotide" or "isolated oligonucleotide" is therefore a substantially purified 
polynucleotide. 

As used herein, the term "ftision protein" refers to a chimeric protein 
containing the protein of interest (e.g., CLEAVASE BN/thrombin nuclease and portions 
or fragments thereof) joined to an exogenous protein fragment (the ftision partner which 
consists of a non CLEAVASE BN/thrombin nuclease protein). The fusion partner may 
enhance solubility of recombinant chimeric protein (e.g., the CLEAVASE BN/thrombin 
nuclease) as expressed in a host cell, may provide an affinity tag (e.g., a his-tag) to allow 
purification of the recombinant fiision protein from the host cell or culture supematant, or 
both. If desired, the fusion protein may be removed from the protein of interest (e.g., 
CLEAVASE BN/thrombin nuclease or fragments thereof) by a variety of enzymatic or 
chemical means known to the art. 

As used herein, the terms "chimeric protein" and "chimerical protein" refer to a 
single protein molecule that comprises amino acid sequences portions derived from two 
or more parent proteins. These parent molecules may be from similar proteins from 
genetically distinct origins, different proteins from a single organism, or different 
proteins from different organisms. By way of example but not by way of limitation, a 
chimeric structure-specific nuclease of the present invention may contain a mixture of 
amino acid sequences that have been derived from FEN-1 genes from two or more of the 
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organisms having such genes, combined to fomi a non-naturally occurring nuclease. The 
term "chimerical" as used herein is not intended to convey any particular proportion of 
contribution from the naturally occurring genes, nor limit the manner in which the 
portions are combined. Any chimeric structure-specific nuclease constructs having 
cleavage activity as determined by the testing methods described herein are improved 
cleavage agents within the scope of the present invention. 

The term "continuous strand of nucleic acid" as used herein is means a strand of 
nucleic acid that has a continuous, covalently linked, backbone structure, without nicks or 
other disruptions. The disposition of the base portion of each nucleotide, whether 
base-paired, single-stranded or mismatched, is not an element in the definition of a 
continuous strand. The backbone of the continuous strand is not limited to the 
ribose-phosphate or deoxyribose-phosphate compositions that are found in naturally 
occurring, unmodified nucleic acids. A nucleic acid of the present invention may 
comprise modifications in the structure of the backbone, including but not limited to 
phosphorothioate residues, phosphonate residues, T substituted ribose residues {e.g,, 
2'-0-methyl ribose) and altemative sugar (e.g., arabinose) containing residues. 

The term "continuous duplex" as used herein refers to a region of double stranded 
nucleic acid in which there is no disruption in the progression of basepairs within the 
duplex (i.e., the base pairs along the duplex are not distorted to accommodate a gap, 
bulge or mismatch with the confines of the region of continuous duplex). As used herein 
the term refers only to the arrangement of the basepairs within the duplex, without 
impUcation of continuity in the backbone portion of the nucleic acid strand. Duplex 
nucleic acids with uninterrupted basepairing, but with nicks in one or both strands are 
within the definition of a continuous duplex. 

The term "duplex" refers to the state of nucleic acids in which the base portions of 
the nucleotides on one strand are bound through hydrogen bonding the their 
complementary bases arrayed on a second strand. The condition of being in a duplex 
form reflects on the state of the bases of a nucleic acid. By virtue of base pairing, the 
strands of nucleic acid also generally assume the tertiary structure of a double helix, 
having a major and a minor groove. The assumption of the heUcal form is implicit in the 
act of becoming duplexed. 
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The term "duplex dependent protein binding" refers to the binding of proteins to 
nucleic acid that is dependent on the nucleic acid being in a duplex, or heUcal form. 

The term "duplex dependent protein binding sites or regions" as used herein refers 
to discrete regions or sequences within a nucleic acid that are bound with particular 
5 affinity by specific duplex-dependent nucleic acid binding proteins. This is in contrast to 
the generahzed duplex-dependent binding of proteins that are not site-specific, such as 
the histone proteins that bind chromatin with little reference to specific sequences or 
sites. 

The term "protein-binding region" as used herein refers to a nucleic acid region 
1 0 identified by a sequence or structure as binding to a particular protein or class of proteins. 
It is within the scope of this definition to include those regions that contain sufficient 
genetic information to allow identifications of the region by comparison to known 
sequences, but which might not have the requisite structure for actual binding {e.g., a 
single strand of a duplex-depending nucleic acid binding protein site). As used herein 
U 15 "protein binding region" excludes restriction endonuclease binding regions. 

The term "complete double stranded protein binding region" as used herein refers 
Jg to the minimum region of continuous duplex required to allow binding or other activity of 

n a duplex-dependent protein. This definition is intended to encompass the observation 

that some duplex dependent nucleic acid binding proteins can interact with full activity 
^ 9 20 with regions of duplex that may be shorter than a canonical protein binding region as 
observed in one or the other of the two single strands, hi other words, one or more 
nucleotides in the region may be allowed to remain unpaired without suppressing 
binding. As used here in, the term "complete double stranded binding region" refers to 
the minimum sequence that will accommodate the binding function. Because some such 
25 regions can tolerate non-duplex sequences in multiple places, although not necessarily 
simultaneously, a single protein binding region might have several shorter sub-regions 
that, when duplexed, will be fully competent for protein binding. 

The term "template" refers to a strand of nucleic acid on which a complementary 
copy is built from nucleoside triphosphates through the activity of a template-dependent 
30 nucleic acid polymerase. Within a duplex the template strand is, by convention, depicted 
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and described as the "bottom" strand. Similarly, the non-template strand is often depicted 
and described as the "top" strand. 

The term "template-dependent RNA polymerase" refers to a nucleic acid 
polymerase that creates new RNA strands through the copying of a template strand as 
5 described above and which does not synthesize RNA in the absence of a template. This 
is in contrast to the activity of the template-independent nucleic acid polymerases that 
synthesize or extend nucleic acids without reference to a template, such as terminal 
deoxynucleotidyl transferase, or Poly A polymerase. 

The term "ARRESTOR molecule" refers to an agent added to or included in an 
10 invasive cleavage reaction in order to stop one or more reaction components from 
participating in a subsequent action or reaction. This may be done by sequestering or 
inactivating some reaction component (e,g, , by binding or base-pairing a nucleic acid 
p component, or by binding to a protein component). The term "ARRESTOR 
1; oUgonucleotide" refers to an oUgonucleotide included in an invasive cleavage reaction in 

1 5 order to stop or arrest one or more aspects of any reaction {e.g., the first reaction and/or 
=l3 any subsequent reactions or actions; it is not intended that the ARRESTOR 
13 oUgonucleotide be limited to any particular reaction or reaction step). This may be done 

;j! by sequestering some reaction component (e.g., base-pairing to another nucleic acid, or 

f4 

B binding to a protein component). However, it is not intended that the term be so limited 

PI 

f| 20 as to just situations in which a reaction component is sequestered. 

As used herein, the term "kit" refers to any deUvery system for deUvering 
materials. In the context of reaction assays, such delivery systems include systems that 
allow for the storage, transport, or delivery of reaction reagents (e.g., oligonucleotides, 
enzymes, etc. in the appropriate containers) and/or supporting materials (e.g., buffers, 

25 written instructions for perfomiing the assay etc.) from one location to another. For 
example, kits include one or more enclosures (e.g., boxes) containing the relevant 
reaction reagents and/or supporting materials. As used herein, the term "fragmented kit" 
refers to a delivery systems comprising two or more separate containers that each contain 
a subportion of the total kit components. The containers may be delivered to the intended 

30 recipient together or separately. For example, a first container may contain an enzyme 
for use in an assay, while a second container contains oligonucleotides. The term 

54 



"fragmented kit" is intended to encompass kits containing Analyte specific reagents 
(ASR's) regulated xmder section 520(e) of the Federal Food, Drug, and Cosmetic Act, but 
are not limited thereto. Indeed, any deUvery system comprising two or more separate 
containers that each contains a subportion of the total kit components are included in the 
5 term "fragmented kit." In contrast, a "combined kit" refers to a delivery system 

containing all of the components of a reaction assay in a single container (e.g., in a single 
box housing each of the desired components). The term "kit" includes both fragmented 
and combined kits. 

As used herein, the term "functional domain" refers to a region, or a part of a 
10 region, of a protein (e.g., an enzyme) that provides one or more functional characteristic 
of the protein. For example, a functional domain of an enzyme may provide, directly or 
j^^J indirectly, one or more activities of the enzyme including, but not limited to, substrate 
0 binding capability and catalytic activity. A functional domain may be characterized 
J; through mutation of one or more amino acids within the functional domain, wherein 

Pf 15 mutation of the amino acid(s) alters the associated functionality (as measured empirically 

1/4 

yfl in an assay) thereby indicating the presence of a functional domain. 

As used herein, the term "heterologous functional domain" refers to a protein 

ff functional domain that is not in its natural environment. For example, a heterologous 

113 

fji functional domain includes a functional domain from one enzyme introduced into another 

^ 20 enzyme. A heterologous functional domain also includes a functional domain native to a 

ill 

protein that has been altered in some way (e.g., mutated, added in multiple copies, etc.). 
A heterologous functional domain may comprise a plurality of contiguous amino acids or 
may include two or more distal amino acids are amino acids fragments (e.g., two or more 
amino acids or fragments with intervening, non-heterologous, sequence). Heterologous 

25 functional domains are distinguished from endogenous functional domains in that the 
heterologous amino acid(s) are joined to or contain amino acid sequences that are not 
found naturally associated with the amino acid sequence in nature or are associated with a 
portion of a protein not found in nature. 

As used herein, the term "altered functionality in a nucleic acid cleavage assay" 

30 refers to a characteristic of an enzyme that has been altered in some manner to differ from 
its natural state (e.g., to differ from how it is found in nature). Alterations include, but 
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are not limited to, addition of a heterologous functional domain (e.g., through mutation or 
through creation of chimerical proteins). In some embodiments, the altered characteristic 
of the enzyme may be one that improves the performance of an enzyme in a nucleic acid 
cleavage assay. Types of improvement include, but are not limited to, improved nuclease 
activity (e.g., improved rate of reaction), improved substrate binding (e.g., increased or 
decreased binding of certain nucleic acid species [e.g., RNA or DNA] that produces a 
desired outcome [e.g., greater specificity, improved substrate turnover, etc.]), and 
improved background specificity (e.g., less undesired product is produced). The present 
invention is not limited by the nucleic cleavage assay used to test improved fimctionality. 
However, in some preferred embodiments of the present invention, an invasive cleavage 
assay is used as the nucleic acid cleavage assay. In certain particularly preferred 
embodiments, an invasive cleavage assay utilizing an RNA target is used as the nucleic 
acid cleavage assay. 

As used herein, the terms "N-terminal" and "C-terminal" in reference to 
polypeptide sequences refer to regions of polypeptides including portions of the N- 
terminal and C-terminal regions of the polypeptide, respectively. A sequence that 
includes a portion of the N-terminal region of polypeptide includes amino acids 
predominantly from the N-terminal half of the polypeptide chain, but is not limited to 
such sequences. For example, an N-terminal sequence may include an interior portion of 
the polypeptide sequence including bases from both the N-terminal and C-terminal halves 
of the polypeptide. The same applies to C-terminal regions. N-terminal and C-terminal 
regions may, but need not, include the amino acid defining the ultimate N-terminal and 
C-terminal ends of the polypeptide, respectively. 

DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a schematic representation of sequential invasive cleavage 
reactions. In step A, an upstream INVADER ohgonucleotide and a downstream probe 
combine with a target nucleic acid strand to form a cleavage structure. In step B, the 
portion of the cleaved signal probe from A combines with a second target nucleic acid 
strand and a labeled signal probe to form a second cleavage structure. In step C, cleavage 
of the labeled second cleavage structure yields a detectable signal. 
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Figure 2 shows schematic representations of several examples of invasive 
cleavage structures comprising RNA target strands (SEQ ID NO: 141). Panel A depicts 
an INVADER oligonucleotide (SEQ ED NO: 142) and probe (SEQ ID NO: 143). Panel B 
depicts an INVADER oligonucleotide (SEQ ID NO: 144) and probe (SEQ ID NO: 143). 
Panel C depicts an INVADER oligonucleotide (SEQ ID NO: 145) and probe (SEQ ED 
NO: 145). Panel D depicts an INVADER oligonucleotide (SEQ ID NO: 145) and probe 
(SEQ ID NO: 146). 

Figure 3 shows schematic representations of two examples of structures that are 
not invasive cleavage structures labelled SEQ ID NOs: 147-1 52. 

Figure 4 shows a schematic representation of a configuration of invasive cleavage 
that is useful for detection of target sequence variations. In A, an invasive cleavage 
structure having overlap between the two probes is formed, and the arrow indicates that it 
is cleavable by the enzymes of the present invention. In B, variation of the target 
sequence removes a region of complementarity to the downstream probe and eliminates 
the overlap. The absence of an arrow in panel B indicates a reduced rate of cleavage of 
this structure compared to that diagrammed in panel A. 

Figure 5 shows a diagram of the X-ray structure of a temary complex of Klentaql 
with primer/template DNA in the polymerizing mode determined by Li et al (Li et al. 
Protein Sci., 7:1 1 16 [1998]). Without intending to represent precise borders between 
features of the physical form, the portions referred to in the text as the "fingers", "thumb" 
and "pahn" regions are loosely indicated by the circle, rectangle, and oval, respectively. 

Figure 6 shows a schematic diagram of the DNA polymerase gene fi:om Thermus 
aquaticus. Restriction sites used in these studies are indicated above. The approximate 
regions encoding various structural or functional domains of the protein are indicated by 
double-headed arrows, below. 

Figure 7 shows a schematic diagram of the chimeric constructs comprising 
portions of the TaqPol gene aud the TthPol gene. Open and shaded boxes denote TaqPol 
and TthPol sequences, respectively. The numbers correspond to the amino acid sequence 
of TaqPol. The 5' nuclease and polymerase domains of TaqPol and the palm, thmnb, and 
fingers regions of the polymerase domain are indicated. The abbreviations for the 
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restrictions sites used for recombination are as follows: E, EcoRI; N, NotI; Bs, BstBI; D, 
Ndel; B, BamHI; and S, Sail. 

Figure 8A-H shows a comparison of the nucleotide structure of the polymerase 
genes isolated from Thermus aquaticus (SEQ ID NO: 153), Thermus flavus (SEQ ID 
NO: 154) and Thermus thermophilus (SEQ ID NO: 155); the consensus sequence (SEQ ID 
NO: 156) is shown at the top of each row. 

Figure 9A-C shows a comparison of the amino acid sequence of the polymerase 
isolated from Thermus aquaticus (SEQ ID NO:157), Thermus flavus (SEQ ID NO:158), 
and Thermus thermophilus (SEQ ID N0:1); the consensus sequence (SEQ ID NO: 159) is 
shown at the top of each row. 

Figure 10 shows the sequences and proposed structures of substrates for the 
invasive signal ampUfication reaction with human IL-6 RNA target strand (SEQ ID 
NO:160) and upstream probe (SEQ ID N0:161). The cleavage site of the downstream 
probe (SEQ ID NO: 162) is indicated by an arrow. Sequence of the IL-6 DNA target 
strand (SEQ ID NO: 163) is shown below. 

Figure 1 1 shows the image generated by a fluorescence imager showing the 
products of invasive cleavage assays using the indicated enzymes, and the IL-6 substrate 
of Figure 10 having either a DNA target strand (A) or an RNA target strand (B). 

Figure 12 compares the cycling cleavage activities of Taq DN RX HT, Tth DN 
RX HT, and Taq-Tth chimerical enzymes with IL-6 substrate having an RNA target 
strand. 

Figure 13 shows a comparison of the amino acid sequences of the Bstl-BamHI 
fragments of TaqPol (SEQ ID NO:164) and TthPol (SEQ ID NO:165). Pairs of similar 
amino acids are shaded with light gray. Aligned amino acids that have a charge 
difference are shaded with dark gray. The numbers correspond to the amino acid 
sequence of TaqPol. Amino acids of TaqPol changed to the corresponding amino acids 
of TthPol by site-directed mutagenesis are indicated by (+). 

Figure 14 compares the cycling cleavage activities of Taq DN RX HT, Taq-Tth 
chimerical enzymes, and chimerical enzymes having the indicated additional amino acid 
modifications, with IL-6 substrate having an RNA target strand. 
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Figure 15 compares the cycling cleavage activities of Taq DN RX HT, Tth DN 
RX HT, and Taq DN RX HT having the indicated amino acid modifications, with IL-6 
substrate having an RNA t£U*get strand. 

Figure 16 compares polymerization activities of TaqPol, TthPol, and Taq-Tth 
chimerical enzymes, and TaqPol having the indicated amino acid modifications. 

Figure 17 shows a diagram of the X-ray structure of a ternary complex of 
Klentaql with primer/template DNA in the polymerizing mode determined by Li et ah 
(Li et al. Protein Sci., 7:1 1 16 [1998]). Amino acids G418 and E507 are indicated. 

Figures 18 A-D show schematic diagrams of examples of substrates that may be 
used to measure various cleavage activities of enzymes. The substrates may be labeled, 
for example, with a fluorescent dye and a quenching moiety for FRET detection, as 
shown, to facilitate detection and measurement. The substrates of ISA and 18B are 
invasive cleavage structures having RNA and DNA target strands, respectively. 18C 
shows an example of an X-structure, and 18D shows an example of a hairpin structure, 
both of which may be used to assess the activity of enzymes on altemative structures that 
may be present in invasive cleavage reactions. 

Figure 19 shows schematic diagrams of chimeric constructs comprising portions 
of the TaqPol gene and the TthPol gene. Open and shaded boxes denote TaqPol and 
TthPol sequences, respectively. The chimeras also include the DN, RX, and HT 
modifications. A table compares the cleavage activity of each protein on the indicated 
cleavage substrates. 

Figure 20A shows a schematic diagram for an RNA containing invasive cleavage 
substrate. The 5' end of the target molecule (SEQ ID NO: 166) is modified with biotin 
and blocked with streptavidin as described. The downstream probe (SEQ ID NO: 167) 
with cleavage site is also shown. Panels B-D show analysis of the properties of the Taq 
DN RX HT G418K/E507Q mutant in cleavage of the shown substrate under conditions 
of varying reaction temperature, KCl concentration, and MgS04 concentration. 

Figure 21 shows schematic diagrams for model substrates used to test enzymes 
for invasive cleavage activity. The molecule shown in 21 A provides a DNA target strand 
(SEQ ID NO: 168), while the model shown in 21B provides an RNA containing target 
strand (SEQ ID NO:167). Both 21A and B show downstream probe SEQ ID NO:166. 
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Figure 22 shows schematic diagrams for model substrates used to test enzymes 
for cleavage activity on altemative, non-invasive structures. 

Figure 23 shows a schematic diagram for a model substrate used to test enzymes 
for invasive cleavage activity. 
5 Figure 24 shows schematic diagrams for a model substrate used to test enzymes 

for invasive cleavage activity on RNA or DNA target strands. 

Figure 25 compares the cycling cleavage activities of Tth DN RX HT, Taq 2M, 
TfiPol, Tsc Pol, and Tfi and Tsc-derived mutant enzymes. 

Figure 26 depicts structures that may be employed to determine the ablity of an 
10 enzyme to cleave a probe in the presence and the absence of an upstream oUgonucleotide. 
Fig. 26 displays the sequence of ohgonucleotide 89-15-1 (SEQ ID NO:212), 
oligonucleotide 81-69-5 (SEQ ID NO:213), ohgonucleotide 81-69-4 (SEQ ID NO:214), 
g oligonucleotide 81-69-3 (SEQ ID NO:215), oligonucleotide 81-69-2 (SEQ ID NO:216) 

5 and a portion of M13mpl8 (SEQ ID NO:217). 

!^ 15 Figure 27 shows the image generated by a fluorescence imager that shows the 

2 dependence of FEN- 1 on the presence of an overlapping upstream ohgonucleotide for 

specific cleavage of the probe, 
n Figure 28a shows the image generated by a fluorescence imager that compares the 

;4 amount of product generated in a standard (ie,, a non-sequential invasive cleavage 
Q 20 reaction) and a sequential invasive cleavage reaction. 

Figure 28b is a graph comparing the amount of product generated in a standard or 

basic (i.e,, a non-sequential invasive cleavage reaction) and a sequential invasive 

cleavage reaction ("INVADER sqrd") (y axis = fluorescence units; x axix = attomoles of 

target). 

25 Figure 29 shows the image generated by a fluorescence imager that shows that the 

products of a completed sequential invasive cleavage reaction cannot cross contaminant a 

subsequent similar reaction. 

Figure 30 shows the sequence of the oligonucleotide employed in an invasive 

cleavage reaction for the detection of HCMV viral DNA; Fig. 30 shows the sequence of 
30 oligonucleotide 89-76 (SEQ JD NO:218), ohgonucleotide 89-44 (SEQ ID NO:219) and 

nucleotides 3057-31 10 of the HCMV genome (SEQ JD NO:220). 
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Figure 3 1 shows the image generated by a fluorescence imager that shows the 
sensitive detection of HCMV viral DNA in samples containing human genomic DNA 
using an invasive cleavage reaction. 

Figure 32 is a schematic that illustrates one embodiment of the present invention, 
5 where the cut probe from an initial invasive cleavage reaction is employed as the 
INVADERoligonucleotide in a second invasive cleavage reaction, and where an 
ARRESTOR oligonucleotide prevents participation of remaining uncut first probe in the 
cleavage of the second probe. 

Figure 33 is a schematic that illustrates one embodiment of the present invention, 
10 where the cut probe from an initial invasive cleavage reaction is employed as an 

integrated INVADER-target complex in a second invasive cleavage reaction, and where 
an ARRESTOR ohgonucleotide prevents participation of remaining uncut first probe in 
the cleavage of the second probe. 

Figure 34 shows three images generated by a fluorescence imager showing that 
|4| 15 two different lengths of T 0-methyl, 3' terminal amine-modified ARRESTOR 

oligonucleotide both reduce non-specific background cleavage of the secondary probe 
when included in the second step of a reaction where the cut probe from an initial 
invasive cleavage reaction is employed as an integrated INVADER-target complex in a 
second invasive cleavage reaction. 
20 Figure 35A shows two images generated by a fluorescence imager showing the 

effects on nonspecific and specific cleavage signal of increasing concentrations of 
primary probe in the first step of a reaction where the cut probe from an initial invasive 
cleavage reaction is employed as the INVADER ohgonucleotide in a second invasive 
cleavage reaction. 

25 Figure 35B shows two images generated by a fluorescence imager showing the 

effects on nonspecific and specific cleavage signal of increasing concentrations of 
primary probe in the first step of a reaction, and inclusion of a T 0-methyl, 3' terminal 
amine-modified ARRESTOR ohgonucleotide in the second step of a reaction where the 
cut probe from an initial invasive cleavage reaction is employed as the INVADER 
30 oligonucleotide in a second invasive cleavage reaction. 
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Figure 35C shows shows a graph generated using the spreadsheet MICROSOFT 
EXCEL software, comparing the effects on nonspecific and specific cleavage signal of 
increasing concentrations of primary probe in the first step of a reaction, in the presence 
or absence of a 2* 0-methyl, 3' terminal amine-modified ARRESTOR oligonucleotide in 

the second step of a reaction where the cut probe from an initial invasive cleavage 
reaction is employed as the INVADER oligonucleotide in a second invasive cleavage 
reaction. 

Figure 36A shows two images generated by a fluorescence imager showing the 
effects on nonspecific and specific cleavage signal of including an unmodified 
ARRESTOR oligonucleotide in the second step of a reaction where the cut probe from an 
initial invasive cleavage reaction is employed as the INVADER oUgonucleotide in a 
second invasive cleavage reaction. 

Figure 36B shows two images generated by a fluorescence imager showing the 
effects on nonspecific and specific cleavage signal of including a 3* terminal amine 
modified ARRESTOR oUgonucleotide, a partially 2' 0-methyl substituted, 3' terminal 
amine modified ARRESTOR ohgonucleotide, or an entirely 2' 0-methyl, 3' terminal 
amine modified ARRESTOR ohgonucleotide in the second step of a reaction where the 
cut probe from an initial invasive cleavage reaction is employed as the INVADER 
oligonucleotide in a second invasive cleavage reaction. 

Figure 37A shows two images generated by a fluorescence imager comparing the 
effects on nonspecific and specific cleavage signal of including ARRESTOR 
oligonucleotides of different lengths in the second step of a reaction where the cut probe 
fi-om an initial invasive cleavage reaction is employed as the INVADER ohgonucleotide 
in a second invasive cleavage reaction. 

Figure 37B shows two images generated by a fluorescence imager comparing the 
effects on nonspecific and specific cleavage signal of including an arrestoer 
oligonucleotides of different lengths in the second step of a reaction where the cut probe 
fi"om an initial invasive cleavage reaction is employed as the INVADER oligonucleotide 
in a second invasive cleavage reaction, and in which a longer variant of the secondary 
probe used in the reactions in Fig. 37A is tested. 
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Figure 37C shows a schematic diagram of a primary probe aligned with several 
ARRESTOR oligonucleotides of different lengths. The region of the primary probe that 
is complementary to the HBV target sequence is underlined. The ARRESTOR 
oligonucleotides are aligned with the probe by complementarity. 

Figure 38 shows two images generated by a fluorescence imager comparing the 
effects on nonspecific and specific cleavage signal of including ARRESTOR 
oUgonucleotides of different lengths in the second step of a reaction where the cut probe 
from an initial invasive cleavage reaction is employed as the INVADER oligonucleotide 
in a second invasive cleavage reaction, using secondary probes of two different lengths. 

Figure 39 shows a graph of the calculated running average of a ten nucleotide 
stretch of the hUbiquitin RNA (the Ave(lO) Index) derived from the SS-count output of 
an mfold analysis, expressed as a percentage of the total number of structures found by 
mfold that include a particular base, plotted against the position of the base. 

Figure 40 shows an example microplate layout for an RNA INVADER assay 
comprising 40 samples, 6 standards, and a No Target Control. 

Figure 41 shows INVADER assay components for use in detecting human (h), 
mouse (m), or rat (r) RNAs of the indicated genes or transcripts. 

Figure 42 shows a computer display of an INVADERCREATOR Order Entry 

screen. 

Figure 43 shows a computer display of an INVADERCREATOR Multiple SNP 
Design Selection screen. 

Figure 44 shows a computer display of an INVADERCREATOR Designer 
Worksheet screen. 

Figure 45 shows a computer display of an INVADERCREATOR Output Page 

screen. 

Figure 46 shows a computer display of an INVADERCREATOR Printer Ready 
Output screen. 

Figure 47 shows INVADER assay components (SEQ ID NOs:709-2640) for use 
in detecting RNA target nucleic acids. These assay components can also be used in the 
detection of cDNA or DNA copies of the RNA targets. Components are grouped per 
RNA analyte to be detected. Where multiple probes, INVADER oHgonucleotides, 
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stacker oligonucleotides, ARRESTOR oligonucleotides, or other components are 
provided, any of the multiple components maybe used, unless indicated otherwise. 
Unless indicated otherwise, oUgonucleotides are presented 5 '-3' orientation. 

Figure 48 shows a chart showing the Ave(lO) Index against base pair position. 

Figure 49 shows INVADER assay components (SEQ ID NOs:2868-4004) for use 
in detecting RNA target nucleic acids. These assay components can also be used in the 
detection of cDNA or DNA copies of the RNA targets. Components are grouped per 
RNA analyte to be detected. Where multiple probes, INVADER oligonucleotides, 
stacker oligonucleotides, ARRESTOR oligonucleotides, or other components are 
provided, any of the multiple components may be used, unless indicated otherwise. 
Unless indicated otherwise, oligonucleotides are presented 5 '-3' orientation. All primary 
probes {Le,, oligonucleotides indicated as "Probes" that are not part of a Secondary 
System) include a 3' amine (this may or may not be indicated), unless another chemistry 
is specifically identified. Natural bases are designated with upper case characters, and 2' 
0-methylated bases are designated with lower case characters. 

Figure 50 A and B show schematic diagrams of two INVADER assay 
embodiments. 

DESCRIPTION OF THE INVENTION 
Introduction 

The present invention relates to methods and compositions for treating nucleic 
acid, and in particular, methods and compositions for detection and characterization of 
nucleic acid sequences and sequence changes. 

In preferred embodiments, the present invention relates to means for cleaving a 
nucleic acid cleavage structure in a site-specific manner. While the present invention 
provides a variety of cleavage agents, in some embodiments, the present invention relates 
to a cleaving enzyme having 5' nuclease activity without interfering nucleic acid 
synthetic abihty. In other embodiments, the present invention provides novel 
polymerases (e.g., thermostable polymerases) possessing altered polymerase and/or 
nucleases activities. 
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For example, in some embodiments, the present invention provides 5' nucleases 
derived from thermostable DNA polymerases that exhibit altered DNA synthetic activity 
from that of native thermostable DNA polymerases. The 5' nuclease activity of the 
polymerase is retained while the synthetic activity is reduced or absent. Such 5' 
nucleases are capable of catalyzing the structure-specific cleavage of nucleic acids in the 
absence of interfering synthetic activity. The lack of synthetic activity during a cleavage 
reaction results in nucleic acid cleavage products of uniform size. 

The novel properties of the nucleases of the invention form the basis of a method 
of detecting specific nucleic acid sequences. This method relies upon the amplification 
of the detection molecule rather than upon the ampUfication of the target sequence itself 
as do existing methods of detecting specific target sequences. 

DNA polymerases (DNAPs), such as those isolated from E, coli or from 
thermophilic bacteria of the genus Thermus as well as other organisms, are enzymes that 
synthesize new DNA strands. Several of the known DNAPs contain associated nuclease 
activities in addition to the synthetic activity of the enzyme. 

Some DNAPs are known to remove nucleotides from the 5' and 3' ends of DNA 
chains (Romberg, DNA Replication, W.H, Freeman and Co., San Francisco, pp. 127-139 
[1980]). These nuclease activities are usually referred to as 5' exonuclease and 3' 
exonuclease activities, respectively. For example, the 5' exonuclease activity located in 
the N-terminal domain of several DNAPs participates in the removal of RNA primers 
during lagging strand synthesis during DNA replication and the removal of damaged 
nucleotides during repair. Some DNAPs, such as the E, coli DNA polymerase 
(DNAPEcl), also have a 3* exonuclease activity responsible for proof-reading during 
DNA synthesis (Romberg, supra). 

A DNAP isolated from Thermus aqiiaticus, termed Taq DNA polymerase 
(DNAPTaq), has a 5' exonuclease activity, but lacks a frmctional 3' exonucleolytic 
domain (Tindall and Kunkell, Biochem., 27:6008 [1988]). Derivatives of DNAPEcl and 
DNAPTaq, respectively called the Klenow and Stoffel fragments, lack 5* exonuclease 
domains as a result of enzymatic or genetic manipulations (Bmtlag et al., Biochem, 
Biophys. Res. Commun., 37:982 [1969]; Erlich et al.. Science 252:1643 [1991]; Setlow 
and Romberg, J. Biol. Chem., 247:232 [1972]). 



65 



The 5' exonuclease activity of DNAPTaq was reported to require concurrent 
synthesis (Gelfand, PGR Technology - Principles and Applications for DNA 
AmpHfication, H.A. Erlich, [Ed.], Stockton Press, New York, p. 19 [1989]). Although 
mononucleotides predominate among the digestion products of the 5' exonucleases of 
5 DNAPTaq and DNAPEcl, short oligonucleotides (< 12 nucleotides) can also be observed 
implying that these so-called 5' exonucleases can function endonucleolytically (Setlow, 
supra; Holland et al., Proc. Natl. Acad. Sci. USA 88:7276 [1991]). 

In WO 92/06200, Gelfand et al. show that the preferred substrate of the 5* 
exonuclease activity of the thermostable DNA polymerases is displaced single-stranded 
10 DNA. Hydrolysis of the phosphodiester bond occurs between the displaced single- 
stranded DNA and the double-helical DNA with the preferred exonuclease cleavage site 
p:. being a phosphodiester bond in the double helical region. Thus, the 5' exonuclease 
Q activity usually associated with DNAPs is a structure-dependent single-stranded 

J* endonuclease and is more properly referred to as a 5* nuclease. Exonucleases are 

1% 

7Z 1 5 enzymes that cleave nucleotide molecules from the ends of the nucleic acid molecule. 

Endonucleases, on the other hand, are enzymes that cleave the nucleic acid molecule at 
p internal rather than terminal sites. The nuclease activity associated with some 

!,J thermostable DNA polymerases cleaves endonucleolytically but this cleavage requires 

9 contact with the 5' end of the molecule being cleaved. Therefore, these nucleases are 
m 20 referred to as 5' nucleases. 

When a 5* nuclease activity is associated with a eubacterial Type A DNA 
polymerase, it is found in the one third N-terminal region of the protein as an independent 
functional domain. The C-terminal two-thirds of the molecule constitute the 
polymerization domain that is responsible for the synthesis of DNA. Some Type A DNA 
25 polymerases also have a 3* exonuclease activity associated with the two-third C-terminal 
region of the molecule. 

The 5' exonuclease activity and the polymerization activity of DNAPs can be 
separated by proteolytic cleavage or genetic manipulation of the polymerase molecule. 
The Klenow or large proteolytic cleavage fragment of DNAPEcl contains the 
30 polymerase and 3' exonuclease activity but lacks the 5' nuclease activity. The Stoffel 
fragment of DNAPTaq (DNAPStf) lacks the 5' nuclease activity due to a genetic 
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manipulation that deleted the N-tenninal 289 amino acids of the polymerase molecule 
(Erlich et al, Science 252:1643 [1991]). WO 92/06200 describes a thermostable DNAP 
with an altered level of 5' to 3* exonuclease. U.S. Patent No. 5,108,892 describes a 
Thermus aquaticus DNAP without a 5' to 3' exonuclease. Thermostable DNA 
5 polymerases with lessened amounts of synthetic activity are available (Third Wave 
Technologies, Madison, WI) and are described in U.S. Pat. Nos. 5,541,311, 5,614,402, 
5,795,763, 5,691,142, and 5,837,450, herein incorporated by reference in their entireties. 
The present invention provides 5' nucleases derived from thermostable Type A DNA 
polymerases that retain 5' nuclease activity but have reduced or absent synthetic activity. 
10 The ability to uncouple the synthetic activity of the enzyme from the 5' nuclease activity 
proves that the 5' nuclease activity does not require concurrent DNA synthesis as was 
Pi previously reported (Gelfand, PGR Technology, supra). 

y In addition to the 5 '-exonuclease domains of the DNA polymerase I proteins of 

Eubacteria, described above, 5' nucleases have been found associated with bacteriophage, 
15 eukaryotes and archaebacteria. Overall, all of the enzymes in this family display very 
similar substrate specificities, despite their limited level of sequence similarity. 

□ Consequently, enzymes suitable for use in the methods of the present invention may be 
isolated or derived from a wide array of sources. 

W A mammahan enzyme with ftmctional similarity to the 5 -exonuclease domain of 

1^1 20 E. coH Pol I was isolated nearly 30 years ago (Lindahl, et al, Proc Natl Acad Sci U S A 
62(2): 597-603 [1969]). Later, additional members of this group of enzymes called flap 
endonucleases (FENl) from Eukarya and Archaea were shown to possess a nearly 
identical structure specific activity (Harrington and Lieber. Embo J 13(5), 1235-46 
[1994]; Murante et al, J Biol Chem 269(2), 1 191-6 [1994]; Robins, et al, J Biol Chem 
25 269(46), 28535-8 [1994]; Hosfield, et al, J Biol Chem 273(42), 27154-61 [1998]), 

despite limited sequence similarity. The substrate specificities of the FENl enzymes, and 
the eubacterial and related bacteriophage enzymes have been examined and found to be 
similar for all enzymes (Lyamichev, et al. Science 260(5109), 778-83 [1993], Harrington 
and Lieber, supra^ Murante, et al, supra, Hosfield, et al supra, Rao, et al, J Bacteriol 
30 180(20), 5406-12 [1998], Bhagwat, et al,. J. Biol Chem 272(45), 28523-30 [1997], 
Garforth and Sayers, Nucleic Acids Res 25(19), 3801-7 [1997]). 
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Using preformed substrates, many of the studies cited above determined that these 
nucleases leave a gap upon cleavage, leading the authors to speculate that DNA 
polymerase must then act to fill in that gap to generate a Ugatable nick. A number of 
other 5' nucleases have been shown to leave a gap or overlap after cleavage of the same 
5 or similar flap substrates. It has since been determined that that all the structure-specific 
5'-exonucleases leave a nick after cleavage if the substrate has an overlap between the 
upstream and downstream duplexes (Kaiser et al, J. Biol .Chem. 274(30):21387-21394 
[1999]). While duplexes having several bases of overlapping sequence can assume 
several different conformations through branch migration, it was determined that 
10 cleavage occurs in the conformation where the last nucleotide at the 3' end of the 

upstream strand is unpaired, with the cleavage rate being essentially the same whether the 
end of the upstream primer is A, G, or T. It was determined to be positional overlap 
O between the 3' end of the upstream primer and downstream duplex, rather then sequence 
'% overlap, that provides optimal cleavage. In addition to allowing these enzymes to leave a 

1 5 nick after cleavage, the single base of overlap causes the enzymes to cleave several 
jfl orders of magnitude faster than when a substrate lacks overlap (Kaiser et al, supra). 

Any of the 5' nucleases described below may find appUcation in one or more 
embodiments of the methods described herein. 5' nucleases of particular utiUty in the 

5 "if 

p methods of present invention include but are not limited to polymerases from a Thermus 

pi 20 species including, but not limited to, Thermus aguaticus, Thermus flavus, Thermus 

thermophilus, Thermus filiformus, and Thermus scotoductus, and altered polymerases. 
Particularly preferred are altered polymerases exhibiting improved performance in 
detection assays based on the cleavage of a DNA member of an invasive cleavage 
structure that comprises an RNA target strand. 
25 Chimerical polymerases may find application in one or more embodiments of the 

present invention, including but not limited to chimerical polymerases comprising one or 
more portions of one or more FEN nucleases including but are not limited to those of 
Methanococcus jannaschii, Methanobacterium thermoautotrophicum, Archaeoglobus 
veneficus, Sulfolobus solfataricus, Pyrobaculum aerophilum, Thermococcus litoralis, 
30 Archaeaglobus profundus, Acidianus brierlyi Acidianus ambivalens, Desulfurococcus 
amylolyticus, Desulfurococcus mobilis, Pyrodictium brockii, Thermococcus gorgonarius, 
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Thermococcus zilligii, Methanopyrus kandleri, Methanococcus igneus, Pyrococcus 
horikoshii, andAeropyrum pernix; particularly preferred FENl enzymes are chimerical 
Archaeoglobus fulgidus and Pyrococcus furiosus. Particularly preferred are altered 
polymerases exhibiting improved performance in detection assays based on the cleavage 
of a DNA member of an invasive cleavage structure that comprises an RNA target strand. 

The detailed description of the invention is presented in the following sections: 

L Detection of Specific Nucleic Acid Sequences Using 5' Nucleases in an 

INVADER Directed Cleavage Assay; 
IL Signal Enhancement By Incorporating The Products Of An Invasive Cleavage 

Reaction Into A Subsequent Invasive Cleavage Reaction; 

III. Effect of ARRESTOR Oligonucleotides on Signal and Background in Sequential 
Invasive Cleavage Reactions. 

IV. Improved Enzymes For Use In INVADER OHgonucleotide-Directed Cleavage 
Reactions Comprising RNA Targets; 

V. Reaction Design for INVADER Assay Detection of RNA Targets; 
VL Kits for performing the RNA INVADER Assay; and 

Vn. The INVADER Assay for Direct Detection and Measurement of Specific RNA 
Analytes. 

I. Detection of Specific Nucleic Acid Sequences Using 5' Nucleases in an 
INVADER Directed Cleavage Assay 

1 . INVADER Assay Reaction design 

The present invention provides means for forming a nucleic acid cleavage 
structure that is dependent upon the presence of a target nucleic acid and cleaving the 
nucleic acid cleavage structure so as to release distinctive cleavage products. 5' nuclease 
activity, for example, is used to cleave the target-dependent cleavage structure and the 
resulting cleavage products are indicative of the presence of specific target nucleic acid 
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sequences in the sample. When two strands of nucleic acid, or oligonucleotides, both 
hybridize to a target nucleic acid strand such that they form an overlapping invasive 
cleavage structure, as described below, invasive cleavage can occur. Through the 
interaction of a cleavage agent (e.g., a 5' nuclease) and the upstream oligonucleotide, the 
5 cleavage agent can be made to cleave the downstream oligonucleotide at an intemal site 
in such a way that a distinctive fragment is produced. Such embodiments have been 
termed the INVADER assay (Third Wave Technologies) and are described in U.S. Patent 
Nos. 5,846,717, 5,985,557, 5,994,069, 6,001,567, 6,090,543, and 6,348,314, and PCT 
Publications WO 97/27214 and WO 98/42873, herein incorporated by reference in their 
10 entireties. 

. The present invention further provides assays in which the target nucleic acid is 
O reused or recycled during multiple rounds of hybridization with oligonucleotide probes 
3i and cleavage of the probes without the need to use temperature cycling (i.e., for periodic 
J' denaturation of target nucleic acid strands) or nucleic acid synthesis (i.e., for the 
m 1 5 polymerization-based displacement of target or probe nucleic acid strands). When a 
'!/ cleavage reaction is run under conditions in which the probes are continuously replaced 
9 on the target strand {e.g. through probe-probe displacement or through an equilibrium 
f j between probe/target association and disassociation, or through a combination 
S comprising these mechanisms, [ Reynaldo et al, J. Mol. Biol. 97: 511 (2000)]) multiple 
20 probes can hybridize to the same target, allowing multiple cleavages, and the generation 
of multiple cleavage products. 

By the extent of its complementarity to a target nucleic acid strand, an 
oUgonucleotide may be said to define a specific region of the target. In an invasive 
cleavage structure, the two oHgonucleotides define and hybridize to regions of the target 
25 that are adjacent to one another (i.e,, regions without any additional region of the target 
between them). Either or both oligonucleotides may comprise additional portions that are 
not complementary to the target strand. In addition to hybridizing adjacently, in order to 
form an invasive cleavage structure, the 3* end of the upstream oligonucleotide must 
comprise an additional moiety. When both oUgonucleotides are hybridized to a target 
30 strand to form a structure and such a 3' moiety is present on the upstream oUgonucleotide 
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within the structure, the oligonucleotides may be said to overlap, and the structure may be 
described as an overlapping, or invasive cleavage structure. 

In one embodiment, the 3' moiety of the invasive cleavage structure is a single 
nucleotide. In this embodiment the 3' moiety may be any nucleotide (Le,, it may be, but 
5 it need not be complementary to the target strand). In a preferred embodiment the 3' 
moiety is a single nucleotide that is not complementary to the target strand. In another 
embodiment, the 3' moiety is a nucleotide-like compound (z.e., a moiety having chemical 
features similar to a nucleotide, such as a nucleotide analog or an organic ring compound; 
See e.g., U.S. Pat. No. 5,985,557). In yet another embodiment the 3' moiety is one or 
10 more nucleotides that duplicate in sequence one or more nucleotides present at the 5' end 
of the hybridized region of the downstream oligonucleotide. In a further embodiment, 
the dupHcated sequence of nucleotides of the 3' moiety is followed by a single nucleotide 
that is not further dupUcative of the downstream ohgonucleotide sequence, and that may 
be any other nucleotide. In yet another embodiment, the dupUcated sequence of 
15 nucleotides of the 3* moiety is followed by a nucleotide-Uke compound, as described 
above. 

The downstream oligonucleotide may have, but need not have, additional moieties 
1% attached to either end of the region that hybridizes to the target nucleic acid strand. In a 

preferred embodiment, the downstream ohgonucleotide comprises a moiety at its 5' end 
^ 20 (z.e., a 5' moiety). In a particularly preferred embodiment, said 5' moiety is a 5' flap or 
arm comprising a sequence of nucleotides that is not complementary to the target nucleic 
acid strand. 

When an overlapping cleavage structure is formed, it can be recognized and 
cleaved by a nuclease that is specific for this structure {i.e. , a nuclease that will cleave 

25 one or more of the nucleic acids in the overlapping structure based on recognition of this 
structure, rather than on recognition of a nucleotide sequence of any of the nucleic acids 
forming the structure). Such a nuclease may be termed a "structure-specific nuclease". 
In some embodiments, the structure-specific nuclease is a 5' nuclease. In a preferred 
embodiment, the structure-specific nuclease is the 5' nuclease of a DNA polymerase. In 

30 another preferred embodiment, the DNA polymerase having the 5' nuclease is synthesis- 
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deficient. In another preferred embodiment, the 5' nuclease is a FEN-1 endonuclease. In 
a particularly preferred embodiment, the 5' nuclease is thermostable. 

In some embodiments, said structure-specific nuclease preferentially cleaves the 
downstream oligonucleotide. In a preferred embodiment, the downstream 
5 oligonucleotide is cleaved one nucleotide into the 5' end of the region that is hybridized 
to the target within the overlapping structure. Cleavage of the overlapping structure at 
any location by a structure-specific nuclease produces one or more released portions or 
firagments of nucleic acid, termed "cleavage products." 

In some embodiments, cleavage of an overlapping structure is performed under 
10 conditions wherein one or more of the nucleic acids in the structure can disassociate (ie, 
, un-hybridize, or melt) from the structure. In one embodiment, fiill or partial 

O disassociation of a first cleavage structure allows the target nucleic acid to participate in 
CI the formation of one or more additional overlapping cleavage structures. In a preferred 
S embodiment, the first cleavage structure is partially disassociated. In a particularly 
W 15 preferred embodiment only the oUgonucleotide that is cleaved disassociates from the first 
J, cleavage structure, such that it may be replaced by another copy of the same 

H oligonucleotide. In some embodiments, said disassociation is induced by an increase in 
ft temperature, such that one or more oligonucleotides can no longer hybridize to the target 
p strand. In other embodiments, said disassociation occurs because cleavage of an 
' ^ 20 ohgonucleotide produces only cleavage products that cannot bind to the target strand 

under the conditions of the reaction. In a preferred embodiment, conditions are selected 
wherein an oligonucleotide may associate with (i,e., hybridize to) and disassociate from a 
target strand regardless of cleavage, and wherein the ohgonucleotide may be cleaved 
when it is hybridized to the target as part of an overlapping cleavage structure. In a 
25 particularly preferred embodiment, conditions are selected such that the number of copies 
of the oligonucleotide that can be cleaved when part of an overlapping structure exceeds 
the number of copies of the target nucleic acid strand by a sufficient amount that when 
the first cleavage structure disassociates, the probability that the target strand will 
associate with an intact copy of the oligonucleotide is greater tiian the probability that 
30 that it will associate with a cleaved copy of the oligonucleotide. 
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In some embodiments, cleavage is performed by a structure-specific nuclease that 
can recognize and cleave structures that do not have an overlap. In a preferred 
embodiment, cleavage is performed by a structure-specific nuclease having a lower rate 
of cleavage of nucleic acid structures that do not comprise an overlap, comp^ed to the 
5 rate of cleavage of structures comprising an overlap. In a particularly preferred 

embodiment, cleavage is performed by a structure-specific nuclease having less than 1% 
of the rate of cleavage of nucleic acid structures that do not comprise an overlap, 
compared to the rate of cleavage of structures comprising an overlap. 

In some embodiments it is desirable to detect the cleavage of the overlapping 
10 cleavage structure. Detection may be by analysis of cleavage products or by analysis of 
^ one or more of the remaining uncleaved nucleic acids. For convenience, the following 
n discussion will refer to the analysis of cleavage products, but it will be appreciated by 
't2 those skilled in the art that these methods may as easily be appUed to analysis of the 
ffl? uncleaved nucleic acids in an invasive cleavage reaction. Any method known in the art 
^ 15 for analysis of nucleic acids, nucleic acid fragments or oligonucleotides may be applied 
to the detection of cleavage products. 

1^ In one embodiment, the cleavage products may be identified by chemical content, 

if 

If I e.g., the relative amounts of each atom, each particular type of reactive group or each 

2 nucleotide base (Chargaff et al., J. Biol. Chem, 111: 405 [1949]) they contain. In this 
20 way, a cleavage product may be distinguished from a longer nucleic acid fi:om which it 
was released by cleavage, or fi*om other nucleic acids. 

In another embodiment, the cleavage products may be distinguished by a 
particular physical attribute, including but not limited to length, mass, charge, or charge- 
to-mass ratio. In yet another embodiment, the cleavage product may be distinguished by 
25 a behavior that is related to a physical attribute, including but not limited to rate of 

rotation in solution, rate of migration during electrophoresis, coefficient of sedimentation 
in centrifiigation, time of flight in MALDI-TOF mass spectrometry, migration rate or 
other behavior in chromatography, melting temperature firom a complementary nucleic 
acid, or precipitability from solution. 
30 Detection of the cleavage products may be through release of a label. Such labels 

32 

may include, but are not limited to one or more of any of dyes, radiolabels such as P or 
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■^^S, binding moieties such as biotin, mass tags, such as metal ions or chemical groups, 
charge tags, such as polyamines or charged dyes, haptens such as digoxgenin, 
luminogenic, phosphorescent or fluorogenic moieties, and fluorescent dyes, either alone 
or in combination with moieties that can suppress or shift emission spectra, such as by 
5 fluorescence resonance energy transfer (FRET) or coUisional fluorescence energy 
transfer. 

In some embodiments, analysis of cleavage products may include physical 
resolution or separation, for example by electrophoresis, hybridization or by selective 
binding to a support, or by mass spectrometry methods such as MALDI-TOF. In other 
1 0 embodiments, the analysis may be performed without any physical resolution or 

separation, such as by detection of cleavage-induced changes in fluorescence as in FRET- 
based analysis, or by cleavage-induced changes in the rotation rate of a nucleic acid in 
solution as in fluorescence polarization analysis. 

Cleavage products cm be used subsequently in any reaction or read-out method 
y 15 that can make use of oligonucleotides. Such reactions include, but are not limited to, 

modification reactions, such as Ugation, tailing with a template-independent nucleic acid 

M 

O polymerase and primer extension with a template-dependent nucleic acid polymerase, 
j'l The modification of the cleavage products may be for purposes including, but not limited 
j£: to, addition of one or more labels or binding moieties, alteration of mass, addition of 
f j 20 specific sequences, or for any other purpose that would facilitate analysis of either the 

cleavage products or analysis of any other by-product, result or consequence of the 

cleavage reaction. 

Analysis of the cleavage products may involve subsequent steps or reactions that 
do not modify the cleavage products themselves. For example, cleavage products may be 

25 used to complete a functional structure, such as a competent promoter for in vitro 

transcription or another protein binding site. Analysis may include the step of using the 
completed structure for or to perform its function. One or more cleavage products may 
also be used to complete an overlapping cleavage structure, thereby enabling a 
subsequent cleavage reaction, the products of which may be detected or used by any of 

30 the methods described herein, including the participation in further cleavage reactions. 
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Certain preferred embodiments of the invasive cleavage reactions are provided in 
the following descriptions. In some embodiments, the methods of the present invention 
employ at least a pair of oligonucleotides that interact with a target nucleic acid to form a 
cleavage structure for a structure-specific nuclease. In some embodiments, the cleavage 
5 structure comprises i) a target nucleic acid that may be either single-stranded or double- 
stranded (when a double-stranded target nucleic acid is employed, it may be rendered 
single stranded, e,g„ by heating); ii) a first oUgonucleotide, termed the "probe," that 
defines a first region of the target nucleic acid sequence by being the complement of that 
region; iii) a second oUgonucleotide, termed the "INVADER ohgonucleotide," the 5' part 
10 of which defines a second region of the same target nucleic acid sequence, adjacent to 
U and downstream of the first target region, and the second part of which overlaps into the 
y region defined by the first oligonucleotide. 

Ci It can be considered that the binding of these oligonucleotides in this embodiment 

rl divides the target nucleic acid into three distinct regions: one region that has 
W 1 5 complementarity to only the probe; one region that has complementarity only to the 
M INVADER oUgonucleotide; and one region that has complementarity to both 

m oUgonucleotides. As discussed above, in some preferred embodiments of the present 
[| invention, the overlap may comprise moieties other than overlapping complementary 
ri bases. Thus, in some embodiments, there is a physical, but not sequence, overlap 
20 between the INVADER and probe oUgonucleotides, i.e., in these latter embodiments, 

there is not a region of the target nucleic acid that has complementarity to both 

oUgonucleotides. 

a) Oligonucleotide design 

Design of these oUgonucleotides (i.e., the INVADER oUgonucleotide and the 
25 probe) is accomplished using practices that are standard in the art. For example, 

sequences that have self-complementarity, such that the resulting oligonucleotides would 
either fold upon themselves, or hybridize to each other at the expense of binding to the 
target nucleic acid, are generally avoided. 

One consideration in choosing a length for these oUgonucleotides is the 
30 complexity of the sample containing the target nucleic acid. For example, the human 
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genome is approximately 3x10^ basepairs in length. Any 10-nucleotide sequence will 
appear with a frequency of 1:4^°, or 1:1,048,576 in a random string of nucleotides, which 
would be approximately 2,861 times in 3 billion basepairs. Clearly, an oligonucleotide of 
this length would have a poor chance of binding uniquely to a 10-nucleotide region 
5 within a target having a sequence the size of the human genome. If the target sequence 
were within a 3 kb plasmid, however, such an oligonucleotide might have a very 
reasonable chance of binding uniquely. By this same calculation it can be seen that an 
oligonucleotide of 16 nucleotides (i.e., a 16-mer) is the minimum length of a sequence 
that is mathematically likely to appear once in 3 x 10^ basepairs. This level of specificity 
1 0 may also be provided by two or more shorter oligonucleotides if liiey are configured to 
t.5 bind in a cooperative fashion (i.e., such that they can produce the intended complex only 

0 if both or all are bound to their intended target sequences), wherein the combination of 

Q 

n the short oligonucleotides provides the desired specificity, hi one such embodiment, the 
15 cooperativity between the shorter oligonucleotides is by a coaxial stacking effect that can 
W 1 5 occur when the oUgomicleotides hybridize to adjacent sites on a target nucleic acid. In 
r another embodiment, the shorter oUgonucleotides ^e connected to one another, either 
y directly, or by one or more spacer regions. The short oligonucleotides thus connected 
ni may bind to distal regions of the target and may be used to bridge across regions of 
Q secondary structure in a target. Examples of such bridging oUgonucleotides are described 
20 in PCX Pubhcation WO 98/50403, herein incorporated by reference in its entirety. 

A second consideration in choosing oligonucleotide length is the temperature 
range in which the oUgonucleotides wiU be expected to function. A 16-mer of average 
base content (50% G-C bases) wiU have a calculated Tm of about 4rc, depending on, 
among other things, the concentration of the oligonucleotide and its target, the salt 
25 content of the reaction and the precise order of the nucleotides. As a practical matter, 
longer oUgonucleotides are usually chosen to enhance the specificity of hybridization. 
Oligonucleotides 20 to 25 nucleotides in length are often used, as they are highly likely to 
be specific if used in reactions conducted at temperatures that are near their TmS (within 
about 5 ""C of the Tm). In addition, with calculated T^s in the range of 50 to TO^'C, such 
30 oUgonucleotides (i.e., 20 to 25-mers) are appropriately used in reactions catalyzed by 
thermostable enzymes, which often display optimal activity near this temperature range. 
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The maximum length of the oligonucleotide chosen is also based on the desired 
specificity. One should avoid choosing sequences that are so long that they are either at a 
high risk of binding stably to partial complements, or that they cannot easily be dislodged 
when desired (e.g., failure to disassociate firom the target once cleavage has occurred or 
5 failure to disassociate at a reaction temperature suitable for the enzymes and other 
materials in the reaction). 

The first step of design and selection of the ohgonucleotides for the INVADER 
oligonucleotide-directed cleavage is in accordance with these sample general principles. 
Considered as sequence-specific probes individually, each ohgonucleotide may be 
10 selected according to the guidelines Usted above. That is to say, each ohgonucleotide 
K will generally be long enough to be reasonably expected to hybridize only to the intended 
M target sequence within a complex sample, usually in the 20 to 40 nucleotide range. 

Altematively, because the INVADER oligonucleotide-directed cleavage assay depends 

if I 

I a upon the concerted action of these oligonucleotides, the composite length of the 2 

'fl 1 5 oligonucleotides which span/bind to the target may be selected to fall within this range, 

0 with each of the individual ohgonucleotides being in approximately the 13 to 17 

[.2 nucleotide range. Such a design might be employed if a non-thermostable cleavage 

1 % 

^ means were employed in the reaction, requiring the reactions to be conducted at a lower 

'<'^ 

1^ temperature than that used when thermostable cleavage means are employed. In some 
20 embodiments, it may be desirable to have these ohgonucleotides bind multiple times 
withm a single target nucleic acid (e.g., to bind to multiple variants or multiple similar 
sequences within a target). It is not intended that the method of the present invention be 
limited to any particular size of the probe or INVADER ohgonucleotide. 

The second step of designing an ohgonucleotide pair for this assay is to choose 
25 the degree to which the upstream "INVADER" ohgonucleotide sequence will overlap 
into the downstream "probe" ohgonucleotide sequence, and consequently, the sizes into 
which the probe will be cleaved. In some preferred embodiments, the probe 
ohgonucleotide can be made to "turn over," that is to say probe can be made to depart to 
allow the binding and cleavage of other copies of the probe molecule, without the 
30 requirements of thermal denaturation or displacement by polymerization. While in one 
embodiment of this assay probe tumover may be facilitated by an exonucleolytic 
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digestion by the cleavage agent, in some preferred embodiments of the present invention 
tumover does not require this exonucleolytic activity. For example, in some 
embodiments, a reaction temperature and reaction conditions are selected so as to create 
an equilibrium wherein the probe hybridizes and disassociates from the target. In other 
5 embodiments, temperature and reaction conditions are selected so that unbound probe can 
initiate binding to the target strand and physically displace bound probe, hi still other 
embodiments, temperature and reaction conditions are selected such that either or both 
mechanisms of probe replacement may occur in any proportion. The method of the 
^ :^ present invention is not limited to any particular mechanism of probe replacement. By 
- 1 0 any mechanism, when the probe is bound to the target to form a cleavage structure, 

cleavage can occur. The continuous cycling of the probe on and off of the target allows 
O multiple probes to bind and be cleaved for each copy of a target nucleic acid. 

% i) Non-sequence Overlaps 

P It has been determined that the relationship between the 3' end of the upstream 

W 

# 15 oligonucleotide and the desired site of cleavage on the probe should be carefully 
iU designed. It is known that the preferred site of cleavage for the types of structure-specific 
; J endonucleases employed herein is one basepair into a duplex (Lyamichev et al, supra), 
® It was previously beheved that the presence of an upstream oligonucleotide or pnmer 
m allowed the cleavage site to be shifted away from this preferred site, into the single 

20 stranded region of the 5' arm (Lyamichev et al, supra and U.S. Patent No. 5,422,253). In 
contrast to this previously proposed mechanism, and while not limiting the present 
invention to any particular mechanism, it is believed that the nucleotide immediately 5', 
or upstream of the cleavage site on the probe (including miniprobe and mid-range probes) 
should be able to basepair with the target for efficient cleavage to occur. In the case of 
25 the present invention, this would be the nucleotide in the probe sequence immediately 
upstream of the intended cleavage site. In addition, as described herein, it has been 
observed that in order to direct cleavage to that same site in the probe, the upstream 
oUgonucleotide should have its 3' base (z.e., nt) immediately upstream of the intended 
cleavage site of the probe. In embodunents where the INVADER and probe 
30 oUgonucleotides share a sequence overlap, this places the 3' terminal nucleotide of the 
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upstream oligonucleotide and the base of the probe oligonucleotide 5' of the cleavage site 
in competition for pairing with the corresponding nucleotide of the target strand. 

To examine the outcome of this competition (i.e. which base is paired during a 
successful cleavage event), substitutions were made in the probe and INVADER 
oligonucleotides such that either the probe or the INVADER oligonucleotide were 
mismatched with the target sequence at this position. The effects of both arrangements 
on the rates of cleavage were examined. When the INVADER oligonucleotide is 
unpaired at the 3' end, the rate of cleavage was not reduced. If this base was removed, 
however, the cleavage site was shifted upstream of the intended site. In contrast, if the 
probe oligonucleotide was not base-paired to the target just upstream of the site to which 
the INVADER oligonucleotide was directing cleavage, the rate of cleavage was 
dramatically reduced, suggesting that when a competition exists, the probe 
oligonucleotide was the molecule to be base-paired in this position. 

It appears that the 3' end of the upstream INVADER oligonucleotide is unpaired 
during cleavage, and yet is important for accurate positioning of the cleavage. To 
examine which part(s) of the 3' terminal nucleotide are required for the positioning of 
cleavage, INVADER oUgonucleotides were designed that terminated on this end with 
nucleotides that were altered in a variety of ways. Sugars examined included 2' 
deoxyribose with a 3' phosphate group, a dideoxyribose, 3' deoxyribose, T 0-methyl 
ribose, arabinose and arabinose with a 3' phosphate. Abasic ribose, with and without 3' 
phosphate were tested. Synthetic ''universal" bases such at 3-nitropyrrole and 5-3 
nitroindole on ribose sugars were tested. Finally, a base-like aromatic ring structure, 
acridine, linked to the 3' end the previous nucleotide without a sugar group was tested. 
The results obtained support the conclusion that the aromatic ring of the base (at the 3' 
end of the INVADER oligonuceotide) is an important moiety for accompHshing the 
direction of cleavage to the desired site within the downstream probe. The 3' terminal 
moiety of the INVADER oUgonucleotide need not be a base that is complementary to the 
target nucleic acid. 



79 



ii) M iniprobes And Mid-Range Probes; 

As discussed above, the INVADER oligonucleotide-directed cleavage assay may 
be performed using INVADER and probe oligonucleotides that have a length of about 
13-25 nucleotides (typically 20-25 nucleotides). It is also contemplated that the 
oHgonucleotides may themselves be composed of shorter oligonucleotide sequences that 
aUgn along a target strand but that are not covalently linked. This is to say that there is a 
nick in the sugar-phosphate backbone of the composite oUgonucleotide, but that there is 
no disruption in the progression of base-paired nucleotides in the resulting duplex. When 
short strands of nucleic acid align contiguously along a longer strand the hybridization of 
each is stabiUzed by the hybridization of the neighboring fragments because the basepairs 
can stack along the heHx as though the backbone was in fact uninterrupted. This 
cooperativity of binding can give each segment a stability of interaction in excess of what 
would be expected for the segment hybridizing to the longer nucleic acid alone. One 
appUcation of this observation has been to assemble primers for DNA sequencing, 
typically about 18 nucleotides long, from sets of three hexamer oHgonucleotides that are 
designed to hybridize in this way (Kotler et al Proc. Natl. Acad. Sci. USA 90:4241 
[1993]). The resulting doubly-nicked primer can be extended enzymatically in reactions 
performed at temperatures that might be expected to disrupt the hybridization of 
hexamers, but not of 18-mers. 

The use of composite or split oligonucleotides is applied with success in the 
INVADER-directed cleavage assay. For example, the probe oligonucleotide may be spHt 
into two oligonucleotides that anneal in a contiguous and adjacent manner along a target 
oligonucleotide such that the downstream ohgonucleotide (analogous to the probe) is 
assembled from two smaller pieces: a short segment of 6-10 nts (termed the 
"miniprobe"), that is to be cleaved in the course of the detection reaction, and an 
oligonucleotide that hybridizes immediately downstream of the miniprobe (termed the 
"stacker"), that serves to stabilize the hybridization of the probe. To form the cleavage 
structure, an upstream oUgonucleotide (the INVADER oligonucleotide) is provided to 
direct the cleavage activity to the desired region of the miniprobe. Assembly of the probe 
from non-linked pieces of nucleic acid {i.e,, the miniprobe and the stacker) allows regions 
of sequences to be changed without requiring the re-synthesis of the entire proven 
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sequence, thus improving the cost and flexibiUty of the detection system. In addition, the 
use of unlinked composite oligonucleotides makes the system more stringent in its 
requirement of perfectly matched hybridization to achieve signal generation, allowing 
this to be used as a sensitive means of detecting mutations or changes in the target nucleic 
5 acid sequences. 

In one embodiment, the methods of the present invention employ at least three 
oligonucleotides that interact with a target nucleic acid to form a cleavage structure for a 
structure-specific nuclease. More specifically, the cleavage structure comprises i) a 
target nucleic acid that may be either single-stranded or double-stranded (when a double- 
10 stranded target nucleic acid is employed, it may be rendered single-stranded, e,g. , by 

heating); ii) a first oUgonucleotide, termed the "stacker," that defines a first region of the 
rj target nucleic acid sequence by being the complement of that region.; iii) a second 
^f! oUgonucleotide, termed the "miniprobe," that defines a second region of the target 
Gil nucleic acid sequence by being the complement of that region ; iv) a third 
5 1 5 oUgonucleotide, termed the "INVADER oligonucleotide," the 5' part of which defines a 
%^ third region of the same target nucleic acid sequence, adjacent to and downstream of the 

II second target region, and the second or 3' part of which overlaps into the region defined 

III by the second oUgonucleotide As described above for embodiments that do not employ a 
g stacker, the overlap region can represent a region where there is a physical, but not 

20 sequence, overlap between the INVADER and probe oUgonucleotides. 

In addition to the benefits cited above, the use of a composite design for the 
oUgonucleotides that form the cleavage stracture allows more latitude in the design of the 
reaction conditions for performing the INVADER-directed cleavage assay. When a 

25 longer probe (e.g., 16-25 nt), as described above, is used for detection in reactions that 
are performed at temperatures below the of that probe, the cleavage of the probe may 
play a significant role in destabilizing the duplex of which it is a part, thus allowing 
turnover and reuse of the recognition site on the target nucleic acid. In contrast, reaction 
temperatures that are at or above the Tm of the probe mean that the probe molecules are 

30 hybridizing and releasing fi-om the target quite rapidly even without cleavage of the 

probe. When an upstream INVADER oUgonucleotide and a cleavage agent are provided 
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the probe will be specifically cleaved, but the cleavage will not be necessary to the 
turnover of the probe. When a long probe {e.g., 16-25 nt) is used in this way the 
temperatures required to achieve this state is high, around 65 to 70°C for a 25-mer of 
average base composition. Requiring the use of such elevated temperatures limits the 
choice of cleavage agents to those that are very thermostable, and may contribute to 
background in the reactions, depending of the means of detection, through thermal 
degradation of the probe oligonucleotides. With miniprobes, this latter mechanism of 
probe replacement may be accompUshed at a lower temperature. Thus, shorter probes are 
preferred for embodiments using lower reaction temperatures. 

The miniprobe of the present invention may vary in size depending on the desired 
appHcation. In one embodiment, the probe may be relatively short compared to a 
standard probe {e.g., 16-25 nt), in the range of 6 to 10 nucleotides. When such a short 
probe is used, reaction conditions can be chosen that prevent hybridization of the 
miniprobe in the absence of the stacker oUgonucleotide. In this way a short probe can be 
made to assume the statistical specificity and selectivity of a longer sequence. In the 
event of a perturbation in the cooperative binding of the miniprobe and stacker nucleic 
acids, as might be caused by a mismatch within the short sequence that is otherwise 
complementary to the target nucleic acid or at the junction between the contiguous 
duplexes, this cooperativity can be lost, dramatically reducing the stability of the shorter 
duplex {i.e., that of the miniprobe), and thus reducing the level of cleaved product in the 
assay of the present invention. 

It is also contemplated that probes of intermediate size may be used. Such probes, 
in the 1 1 to 15 nucleotide range, may blend some of the features associated with the 
longer probes as originally described, these features including the ability to hybridize and 
be cleaved absent the help of a stacker oUgonucleotide. At temperatures below the 
expected T^ of such probes, the mechanisms of turnover may be as discussed above for 
probes in the 20 nt range, and be dependent on the removal of the sequence in the overlap 
region for destabilization and cycling. 

The mid-range probes may also be used at elevated temperatures, at or above their 
expected Tm, to allow melting rather than cleavage to promote probe tumover. In 
contrast to the longer probes described above, however, the temperatures required to 
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allow the use of such a thermally driven turnover are much lower (about 40 to SO^'C), 
thus preserving both the cleavage means and the nucleic acids in the reaction from 
thermal degradation. In this way, the mid-range probes may perform in some instances 
like the miniprobes described above. In a further similarity to the miniprobes, the 
accumulation of cleavage signal from a mid-range probe may be helped under some 
reaction conditions by the presence of a stacker. 

To summarize, a standard long probe usually does not benefit from the presence 
of a stacker oligonucleotide downstream (the exception being cases where such an 
oligonucleotide may also disrupt structures in the target nucleic acid that interfere with 
the probe binding), and it may be used in conditions requiring several nucleotides to be 
removed to allow the oligonucleotide to release from the target efficiently. If temperature 
of the reaction is used to drive exchange of the probes, standard probes may require use 
of a temperature at which nucleic acids and enzymes are at higher risk of thermal 
degradation. 

The miniprobe is very short and performs optimally in the presence of a 
downstream stacker oUgonucleotide. The miniprobes are well suited to reactions 
conditions that use the temperature of the reaction to drive rapid exchange of the probes 
on the target regardless of whether any bases have been cleaved. In reactions with 
sufficient amount of the cleavage means, the probes that do bind will be rapidly cleaved 
before they melt off 

The mid-range or midiprobe combines features of these probes and can be used in 
reactions like those favored by long probes, with longer regions of overlapto drive probe 
turnover at lower temperature. In a preferred embodiment, the midrange probes are used 
at temperatures sufficiently high that the probes are hybridizing to the target and 
releasing rapidly regardless of cleavage. The mid-range probe may have enhanced 
performance in the presence of a stacker under some circumstances. 

The distinctions between the mini-, midi- (ue,, mid-range) and long probes are not 
contemplated to be inflexible and based only on length. The performance of any given 
probe may vary with its specific sequence, the choice of solution conditions, the choice of 
temperature and the selected cleavage means. 
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The assemblage of oligonucleotides that comprises the cleavage structure of the 
present invention is sensitive to mismatches between the probe and the target. It is also 
contemplated that a mismatch between the INVADER oligonucleotide and the target may 
be used to distinguish related target sequences. In the 3-oUgonucleotide system, 
comprising an INVADER, a probe and a stacker oligonucleotide, it is contemplated that 
mismatches may be located within any of the regions of duplex formed between these 
oligonucleotides and the target sequence. In a preferred embodiment, a mismatch to be 
detected is located in the probe. In a particularly preferred embodiment, the mismatch is 
in the probe, at the basepair immediately upstream (i.e., 5') of the site that is cleaved 
when the probe is not mismatched to the target. 

In another preferred embodiment, a mismatch to be detected is located within the 
region defined by the hybridization of a miniprobe. In a particularly preferred 
embodiment, the mismatch is in the miniprobe, at the basepair immediately upstream 
(Le., 5') of the site that is cleaved when the miniprobe is not mismatched to the target, 

iii) Software for Oligonucleotide Design for the INVADER assay 
The present invention provides systems and methods for the design of 
oHgonucleotides for use in detection assays. In particular, the present invention provides 
systems and methods for the design of oligonucleotides that successfully hybridize to 
appropriate regions of target nucleic acids (e.g., regions of target nucleic acids that do not 
contain secondary structure) under the desired reaction conditions (e.g., temperature, 
buffer conditions, etc.) for the detection assay. The systems and methods also allow for 
the design of multiple different oligonucleotides (e.g., oHgonucleotides that hybridize to 
different portions of a target nucleic acid or that hybridize to two or more different target 
nucleic acids) that all function in the detection assay under the same or substantially the 
same reaction conditions. These systems and methods may also be used to design control 
samples that work under the experimental reaction conditions. 

While the systems and methods of the present invention are not limited to any 
particular detection assay, the following description illustrates the invention when used in 
conjunction with the INVADER assay (Third Wave Technologies, Madison WI; See e.g., 
U.S. Pat Nos. 5,846,717, 5,985,557, 5,994,069, and 6,001,567 andPCT Publications WO 
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97/27214 and WO 98/42873, incorporated herein by reference in their entireties) to detect 
a SNP. One skilled in the art will appreciate that specific and general features of this 
illustrative example are generally appUcable to other detection assays, and for use in 
designing INVADER assays for purposes other than SNP detection (e.g., for DNA or 
RNA quantitation, for RNA spUce junction detection, etc.). Further, it will be 
appreciated that all algorithms described herein can be applied as separate software 
elements, or calculations may be performed manually, for the design of any INVADER 
assay probe set without use of the INVADERCREATOR design system described below. 

Oligonucleotide Design for the INVADER assay using the 
INVADERCREATOR program 

In some embodiments where an oUgonucleotide is designed for use in the 
INVADER assay to detect a SNP, the sequence(s) of interest are entered into the 
INVADERCREATOR program (Third Wave Technologies, Madison, WI). As described 
above, sequences may be input for analysis from any number of sources, either directly 
into the computer hosting the INVADERCREATOR program, or via a remote computer 
linked through a communication network (e.g., a LAN, Intranet or Intemet network). 
The program designs probes for both the sense and antisense strand. Strand selection is 
generally based upon the ease of synthesis, minimization of secondary structure 
formation, and manufacturability. In some embodiments, the user chooses the strand for 
sequences to be designed for. In other embodiments, the software automatically selects 
Ihe strand. By incorporating thermodynamic parameters for optimum probe cycling and 
signal generation (AUawi and SantaLucia, Biochemistry, 36:10581 [1997]), 
oUgonucleotide probes may be designed to operate at a pre-selected assay temperature 
(e.g., 63 ""C). Based on these criteria, a final probe set (e.g., primary probes for 2 alleles 
and an INVADER oligonucleotide) is selected. 

In some embodiments, the INVADERCREATOR system is a web-based program 
with secure site access that contains a link to BLAST (available at the National Center for 
Biotechnology Information, National Library of Medicine, National Institutes of Health 
web site) and that can be Unked to RNAstructure (Mathews et al, RNA 5:1458 [1999]), a 
software program that incorporates mfold (Zuker, Science, 244:48 [1989]). 
RNAstructure tests the proposed oligonucleotide designs generated by 
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INVADERCREATOR for potential uni- and bimolecular complex formation. 
INVADERCREATOR is open database connectivity (ODBC)-compliant and uses the 
Oracle database for export/integration. The INVADERCREATOR system was 
)■ ' configured with Oracle to work well with UNIX systems, as most genome centers are 

■• 5 UNIX-based. 

In some embodiments, the INVADERCREATOR analysis is provided on a 
separate server {e.g, , a Sun server) so it can handle analysis of large batch j obs. For 
example, a customer can submit up to 2,000 SNP sequences in one email. The server 
passes the batch of sequences on to the EsTVADERCREATOR software, and, when 
10 initiated, the program designs SNP sets. In some embodiments, probe set designs are 
retumed to the user within 24 hours of receipt of the sequences. 

Mis 

CI In some preferred embodiments, each INVADER assay reaction includes at least 

jSf two target sequence-specific, unlabeled ohgonucleotides for the primary reaction: an 
upstream INVADER oligonucleotide and a downstream Probe ohgonucleotide. The 
1 5 INVADER oligonucleotide is generally designed to bind stably at the reaction 

temperature, while the probe is designed to freely associate and disassociate with the 
y target strand, with cleavage occurring only when an uncut probe hybridizes adjacent to an 
m overlapping INVADER oligonucleotide. In some embodiments, the probe includes a 5' 
j£: flap or "arm" that is not complementary to the target, and this flap is released from the 
n 20 probe when cleavage occurs. In some embodiments, the released flap participates as an 
INVADER oligonucleotide in a secondary reaction. 

The following discussion provides one example of how a user interface for an 
INVADERCREATOR program may be configured. 

The user opens a work screen (Figure 42), e.g., by clicking on an icon on a 
25 desktop display of a computer {e.g,, a Windows desktop). The user enters information 
related to the target sequence for which an assay is to be designed. In some 
embodiments, the user enters a target sequence. In other embodiments, the user enters a 
code or number that causes retrieval of a sequence from a database. In still other 
embodiments, additional information may be provided, such as the user's name, an 
30 identifying number associated with a target sequence, and/or an order number. In 

preferred embodiments, the user indicates (e.g. via a check box or drop down menu) that 

86 



the target nucleic acid is DNA or RNA. In other preferred embodiments, the user 
indicates the species from which the nucleic acid is derived. In particularly preferred 
embodiments, the user indicates whether the design is for monoplex (i.e., one target 
sequence or allele per reaction) or multiplex (ie., multiple target sequences or alleles per 
5 reaction) detection. When the requisite choices and entries are complete, the user starts 
the analysis process. In one embodiment, the user chcks a "Go Design It" button to 
continue. 

In some embodiments, the software validates the field entries before proceeding. 
In some embodiments, the software verifies that any required fields are completed with 
• 10 the appropriate type of information. In other embodiments, the software verifies that the 
ti: input sequence meets selected requirements (e.g., minimimi or maximum length, DNA or 
5 RNA content). If entries in any field are not found to be valid, an error message or dialog 
HI box may appear. In preferred embodiments, the error message indicates which field is 
7i incomplete and/or incorrect. Once a sequence entry is verified, the software proceeds 

1 5 with the assay design, 
.r In some embodiments, the information suppUed in the order entry fields specifies 

y what type of design will be created. In preferred embodiments, the target sequence and 
W multiplex check box specify which type of design to create. Design options include but 
Q are not limited to SNP assay, Multiplexed SNP assay (e.g., wherein probe sets for 
' 20 different alleles are to be combined in a single reaction), Multiple SNP assay (e.g., 

wherein an input sequence has multiple sites of variation for which probe sets are to be 
designed), and Multiple Probe Arm assays. 

In some embodiments, the INVADERCREATOR software is started via a Web 
Order Entry (WebOE) process (i.e., through an Intra/Intemet browser interface) and these 
25 parameters are transferred from the WebOE via applet <param> tags, rather than entered 
through menus or check boxes. 

In the case of Multiple SNP Designs, the user chooses two or more desigus to 
work with. In some embodiments, this selection opens a new screen view (e.g., a 
Multiple SNP Design Selection view Figure 43). In some embodiments, the software 
30 creates designs for each locus in the target sequence, scoring each, and presents them to 
the user in this screen view. The user can then choose any two designs to work with. In 
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some embodiments, the user chooses a first and second design (e.g., via a menu or 
buttons) and clicks a "Go Design It" button to continue. 

To select a probe sequence that will perform optimally at a pre-selected reaction 
temperature, the melting temperature (Tm) of the SNP to be detected is calculated using 
the nearest-neighbor model and published parameters for DNA duplex formation (AUawi 
and SantaLucia, Biochemistry, 36:10581 [1997]), hi embodiments wherein the target 
strand is RNA, parameters appropriate for RNA/DNA heteroduplex formation may be 
used. Because the assay's salt concentrations are often different than the solution 
conditions in which the nearest-neighbor parameters were obtained (IM NaCl and no 
divalent metals), and because the presence and concentration of the enzyme influence 
optimal reaction temperature, an adjustment should be made to the calculated Ttn to 
determine the optimal temperature at which to perform a reaction. One way of 
compensating for these factors is to vary the value provided for the salt concentration 
within the melting temperature calculations. This adjustment is termed a 'salt correction'. 
As used herein, the term "salt correction" refers to a variation made in the value provided 
for a salt concentration for the purpose of reflecting the effect on a Txn calculation for a 
nucleic acid duplex of a non-salt parameter or condition affecting said duplex. Variation 
of the values provided for the strand concentrations will also affect the outcome of these 
calculations. By using a value of 0.5 M NaCl (SantaLucia, Proc Natl Acad Sci USA, 
95:1460 [1998]) and strand concentrations of about 1 mM of the probe and 1 fM target, 
the algorithm used for calculating probe-target melting temperature has been adapted for 
use in predicting optimal INVADER assay reaction temperature. For a set of 30 probes, 
the average deviation between optimal assay temperatures calculated by this method and 
those experimentally determined is about 1.5 °C. 

The length of the downstream probe analyte-specific region (ASR) is defined by 
the temperature selected for running the reaction (e.g., 63°C). Starting from the position 
of the variant nucleotide on the target DNA (the target base that is paired to the probe 
nucleotide 5' of the intended cleavage site), and adding on the 3' end, an iterative 
procedure is used by which the length of the target-binding region of the probe is 
increased by one base pair at a time until a calculated optimal reaction temperature (Tm 
plus salt correction to compensate for enzyme effect) matching the desired reaction 
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temperature is reached. The non-complementary arm of the probe is preferably selected 
to allow the secondary reaction to cycle at the same reaction temperature. The entire 
probe oUgonucleotide is screened using programs such as mfold (Zuker, Science, 244: 48 
[1989]) or Oligo 5.0 (Rychlik and Rhoads, Nucleic Acids Res, 17: 8543 [1989]) for the 
possible formation of dimer complexes or secondary structures that could interfere with 
the reaction. The same principles are also followed for INVADER oligonucleotide 
design. Briefly, starting from the position N on the target DNA, the 3' end of the 
INVADER oUgonucleotide is designed to have a nucleotide not complementary to either 
allele suspected of being contained in the sample to be tested. The mismatch does not 
adversely affect cleavage (Lyamichev et al. Nature Biotechnology, 17: 292 [1999]), and 
it can enhance probe cycling, presumably by minimizing coaxial stabilization effects 
between the two probes. Additional residues complementary to the target DNA starting 
from residue N-1 are then added in the 5' direction until the stability of the 
INVADERoUgonucleotide-target hybrid exceeds that of the probe (and therefore the 
planned assay reaction temperature), generally by 15-20 "^C. 

In some embodiments, the released cleavage fragment from a primary reaction is 
to be used in a secondary reaction. It is one aspect of the assay design that the all of the 
probe sequences may be selected to allow the primary and secondary reactions to occur at 
the same optimal temperature, so that the reaction steps can run simultaneously. In an 
altemative embodiment, the probes may be designed to operate at different optimal 
temperatures, so that the reaction steps are not simultaneously at their temperature 
optima. 

In some embodiments, the software provides the user an opportunity to change 
various aspects of the design including but not limited to: probe, target and INVADER 
oUgonucleotide temperature optima and concentrations; blocking groups; probe arms; 
dyes, capping groups and other adducts; individual bases of the probes and targets (e.g., 
adding or deleting bases from the end of targets and/or probes, or changing internal bases 
in the INVADER and/or probe and/or target oligonucleotides). In some embodiments, 
changes are made by selection from a menu. In other embodiments, changes are entered 
into text or dialog boxes. In preferred embodiments, this option opens a new screen (e.g., 
a Designer Worksheet view, Figure 44). 
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In some embodiments, the software provides a scoring system to indicate the 
quality (e.g., the likelihood of perforaiance) of the assay designs. In one embodiment, 
the scoring system includes a starting score of points 100 points) wherein the 
starting score is indicative of an ideal design, and wherein design features known or 

5 suspected to have an adverse affect on assay performance are assigned penalty values. 
Penalty values may vary depending on assay parameters other than the sequences, 
including but not limited to the type of assay for which the design is intended (e.g., 
monoplex, multiplex) and the temperature at which the assay reaction will be performed. 
The following example provides illustrative scoring criteria for use with some 

10 embodiments of the INVADER assay based on an intelligence defined by 

experimentation. Examples of design features that may incur score penalties include but 
are not limited to the following [penalty values are indicated in brackets, first number is 
for lower temperature assays (e.g., 62-64 °C), second is for higher temperature assays 
(e.g., 65-66 °C)]: 
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1. [100:100] 3' end of INVADER oligonucleotide resembles the probe arm: 
ARM SEQUENCE: PENALTY AWARDED IF INVADER 

rj OLIGONUCLEOTIDE ENDS IN: 

S Arml: CGCGCCGAGG 5'...GAGGX or 5'...GAGGXX 

S 20 Arm 2: ATGACGTGGCAGAC 5' . . .CAGACX or 5' . . .CAGACXX 

Arm 3: ACGGACGCGGAG 5\..GGAGX or 5'...GGAGXX 

Ami 4: TCCGCGCGTCC 5'...GTCCX or 5'...GTCCXX 

2. [70:70] a probe has 5-base stretch (i.e., 5 of the same base in a row) containing 
the polymorphism; 

25 3. [60:60] a probe has 5-base stretch adjacent to the polymorphism; 

4. [50:50] a probe has 5-base stretch one base from the polymorphism; 

5. [40:40] a probe has 5-base stretch two bases fi-om the polymorphism; 

6. [50:50] probe 5-base stretch is of Gs - additional penalty; 

7. [100:100] a probe has 6-base stretch anywhere; 

30 8, [90:90] a two or three base sequence repeats at least four times; 
9. [100:100] a degenerate base occurs in a probe; 
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10. [60:90] probe hybridizing region is short (13 bases or less for designs 65-67°C; 12 
bases or less for designs 62-64°C) 

1 1 . [40:90] probe hybridizing region is long (29 bases or more for designs 65-67°C, 28 
bases or more for designs 62-64°C) 

12. [5 :5] probe hybridizing region length - per base additional penalty 

13. [80:80] Ins/Del design with poor discrimination in first 3 bases after probe arm 

14. [100: 100] calculated INVADER oligonucleotide Tm within 7.5°C of probe target 
Tm (designs 65-67°C with INVADER oligonucleotide less than < 70.5°C, designs 62- 
64°C with INVADER oUgonucleotide < 69.5°C 

15. [20:20] calculated probes Tms differ by more than 2.0^C 

16. [100:100] aprobe has calculated Tm 2'^C less than its target Tm 

17. [10:10] target of one strand 8 bases longer than that of other strand 

18. [30:30] INVADER oUgonucleotide has 6-base stretch anywhere - initial penalty 

19. [70:70] INVADER oligonucleotide 6-base stretch is of Gs - additional penalty 

20. [15:15] probe hybridizing region is 14, 15 or 24-28 bases long (65-67°C) or 13,14 
or 26,27 bases long (62-64°C) 

21. [ 1 5 : 1 5] a probe has a 4-base stretch of Gs containing the polymorphism 

In particularly preferred embodiments, temperatures for each of the 
oUgonucleotides in the designs are recomputed and scores are recomputed as changes are 
made. In some embodiments, score descriptions can be seen by clicking a "descriptions" 
button. In some embodiments, a BLAST search option is provided. In preferred 
embodiments, a BLAST search is done by cUcking a "BLAST Design" button. In some 
embodiments, this action brings up a dialog box describing the BLAST process. In 
preferred embodiments, the BLAST search results are displayed as a highlighted design 
on a Designer Worksheet. 

In some embodiments, a user accepts a design by clicking an "Accept" button. In 
other embodiments, the program approves a design without user intervention. In 
preferred embodiments, the program sends the approved design to a next process step 
(e.g., into production; into a file or database). In some embodiments, the program 
provides a screen view (e.g., an Output Page, Figure 45), allowing review of the final 
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designs created and allowing notes to be attached to the design. In preferred 
embodiments, the user can return to the Designer Worksheet ie.g.,hy clicking a "Go 
Back" button) or can save the design (e.g., by clicking a "Save It" button) and continue 
(e,g., to submit the designed oHgonucleotides for production). 

In some embodiments, the program provides an option to create a screen view of 
a design optimized for printing (e.g, a text-only view) or other export (e.g., an Output 
view, Figure 46). In preferred embodiments, the Output view provides a description of 
the design particularly suitable for printing, or for exporting into another application (e.g, 
by copying and pasting into another appUcation). In particularly preferred embodiments, 
the Output view opens in a separate window. 

The present invention is not limited to the use of the INVADERCREATOR 
software. Indeed, a variety of software programs are contemplated and are commercially 
available, including, but not limited to GCG Wisconsin Package (Genetics computer 
Group, Madison, WT) and Vector NTI (Informax, Rockville, Maryland). 

b) Design Of The Reaction Conditions 

Target nucleic acids (e.g., RNA and DNA) that may be analyzed using the 
methods of the present invention that employ a 5' nuclease or other appropriate cleavage 
agents. Such nucleic acids may be obtained using standard molecular biological 
techniques. For example, nucleic acids (RNA or DNA) may be isolated from a tissue 
sample (e.g., a biopsy specimen), tissue culture cells, samples containing bacteria and/or 
viruses (including cultures of bacteria and/or viruses), etc. The target nucleic acid may 
also be transcribed in vitro from a DNA template or may be chemically synthesized or 
amplified in by polymerase chain reaction. Furthermore, nucleic acids may be isolated 
from an organism, either as genomic material or as a plasmid or similar 
extrachromosomal DNA, or they may be a fragment of such material generated by 
treatment with a restriction endonuclease or other cleavage agent, or a shearing force, or 
it may be synthetic. 

Assembly of the target, probe, and INVADER oligonucleotide nucleic acids into 
the cleavage reaction of the present invention uses principles commonly used in the 
design of oligonucleotide-based enzymatic assays, such as dideoxynucleotide sequencing 
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and polymerase chain reaction (PGR). As is done in these assays, the oUgonucleotides 
are provided in sufficient excess that the rate of hybridization to the target nucleic acid is 
very rapid. These assays ^e commonly performed with 50 finoles to 2 pmoles of each 
oligonucleotide per microliter of reaction mixture, although they are not necessarily 
5 Umited to this range. In the Examples described herein, amounts of oligonucleotides 
ranging firom 250 finoles to 5 pmoles per microliter of reaction volxmie were used. These 
values were chosen for the purpose of ease in demonstration and are not intended to limit 
the performance of the present invention to these concentrations. Other {e.g., lower) 
oligonucleotide concentrations commonly used in other molecular biological reactions 
10 are also contemplated. 

It is desirable that an INVADER oligonucleotide be immediately available to 
P direct the cleavage of each probe oligonucleotide that hybridizes to a target nucleic acid, 
y In some embodiments described herein, the INVADER oUgonucleotide is provided in 
excess over the probe oligonucleotide. While this is an effective means of making the 
y 15 INVADER oligonucleotide immediately available in such embodiments it is not intended 
that the practice of the present invention be limited to conditions wherein the INVADER 
Q oligonucleotide is m excess over the probe, or to any particular ratio of ENVADER-to- 
probe (e.g., in some preferred embodiments described herein, the probe is provided in 
2 excess over the INVADER ohgonucleotide). Another means of assuring the presence of 
II 20 an INVADER oUgonucleotide whenever a probe binds to a target nucleic acid is to 

design the INVADER oligonucleotide to hybridize more stably to the target, z.e., to have 
a higher Tm than the probe. This can be accompUshed by any of the means of increasing 
nucleic acid duplex stability discussed herein (e.g., by increasing the amount of 
complementarity to the target nucleic acid). 
25 Buffer conditions should be chosen that are compatible with both the 

ohgonucleotide/target hybridization and with the activity of the cleavage agent. The 
optimal buffer conditions for nucleic acid modification enzymes, and particularly DNA 
modification enzymes, generally included enough mono- and di-valent salts to allow 
association of nucleic acid strands by base-pairing. If the method of the present invention 
30 is performed using an enzymatic cleavage agent other than those specifically described 
here, the reactions may generally be performed in any such buffer reported to be optimal 
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for the nuclease function of the cleavage agent. In general, to test the utiUty of any 
cleavage agent in this method, test reactions are performed wherein the cleavage agent of 
interest is tested in the MOPS/MnCk/KCl buffer or Mg-containing buffers described 
herein and in whatever buffer has been reported to be suitable for use with that agent, in a 
5 manufacturer's data sheet, a journal article, or in personal communication. 

The products of the INVADER oligonucleotide-directed cleavage reaction are 
fragments generated by structure-specific cleavage of the input oligonucleotides. The 
resulting cleaved and/or uncleaved oUgonucleotides may be analyzed and resolved by a 
number of methods including, but not limited to, electrophoresis (on a variety of supports 
10 including acrylamide or agarose gels, paper, etc.), chromatography, fluorescence 
t,, polarization, mass spectrometry and chip hybridization. In some Examples the invention 
O is illustrated using electrophoretic separation for the analysis of the products of the 
li cleavage reactions. However, it is noted that the resolution of the cleavage products is 
5 not limited to electrophoresis. Electrophoresis is chosen to illustrate the method of the 
W 1 5 invention because electrophoresis is widely practiced in the art and is easily accessible to 
=r the average practitioner. In other Examples, the invention is illustrated without 
;2 electrophoresis or any other resolution of the cleavage products. 

m 

n The probe and INVADER oligonucleotides may contain a label to aid in their 

fit 3? 

S detection following the cleavage reaction. The label may be a radioisotope (e.g., a or 
^ 20 ^^S-labelled nucleotide) placed at either the 5' or 3' end of the oligonucleotide or 
altematively, the label may be distributed throughout the oUgonucleotide (i.e., a 
uniformly labeled oUgonucleotide). The label may be a nonisotopic detectable moiety, 
such as a fluorophore, that can be detected directly, or a reactive group that permits 
specific recognition by a secondary agent. For example, biotinylated oHgonucleotides 
25 may be detected by probing with a streptavidin molecule that is coupled to an indicator 
(e.g., alkaline phosphatase or a fluorophore) or a hapten such as dioxigenin may be 
detected using a specific antibody coupled to a similar indicator. The reactive group may 
also be a specific configuration or sequence of nucleotides that can bind or otherwise 
interact with a secondary agent, such as another nucleic acid, and enzyme, or an antibody. 
30 In some embodiments, a probe is labeled with fluorescing moiety and a quenching 

moiety, wherein cleavage of the cleavage structure separates the fluorescing moiety fi-om 
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the quenching moiety, resulting in a detectable signal (e.g., FRET detection). In some 
embodiments, a change in quenching of signal from a donor fluorophor is detected, while 
in other embodiments, a change in emission from an acceptor fluorophore is detected. In 
still other embodiments, the effect of FRET on both donor and acceptor emissions are 

detected. 

In some embodunents of FRET detection, the fluorescence lifetime of the 
fluorescence emitter is measured {e.g., as in time-resolved fluorescence). While not 
limiting time-resolved fluorescence detection embodiments to any particular labeling 
systems, examples of tags that are useftd in time-resolved FRET measurements include 
europium chelate (Eu^^; Biosclair, etal, J. Biomolecular Screening 5(5):319 [2000]), 
europium trisbipyridine cryptate (TBPEu^"^; Alpha-Bazin, et al. Anal. Biochem. 
286(1): 17 [2000]), and ruthenium Ugand complex {[Ru(bpy)2(phen-ITC)]^^; Youn, et al. 
Anal. Biochem. 232(1):24 [1995]; Lakowicz, et al.. Anal. Biochem. 288:62 [2001]). 

c) Optimization Of Reaction Conditions 

The INVADER ohgonucleotide-directed cleavage reaction is useftil to detect the 
presence of specific nucleic acids. In addition to the considerations Usted above for the 
selection and design of the INVADER and probe oligonucleotides, the conditions under 
which the reaction is to be performed may be optimized for detection of a specific target 
sequence. 

One objective in optimizing the INVADER oligonucleotide-directed cleavage 
assay is to allow specific detection of the fewest copies of a target nucleic acid. To 
achieve this end, it is desirable that tiie combined elements of the reaction interact with 
the maximum efficiency, so that the rate of the reaction (e.g., the number of cleavage 
events per minute) is maximized. Elements contributing to the overall efficiency of the 
reaction include the rate of hybridization, the rate of cleavage, and the efficiency of the 
release of the cleaved probe. 

The rate of cleavage will be a function of the cleavage means chosen, and may be 
made optimal according to the manufacturer's instructions when using commercial 
preparations of enzymes or as described in the examples herein. The other elements (rate 
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of hybridization, efficiency of release) depend upon the execution of the reaction, and 
optimization of these elements is discussed below. 

Three elements of the cleavage reaction that significantly affect the rate of nucleic 
acid hybridization are the concentration of the nucleic acids, the temperature at which the 
cleavage reaction is performed and the concentration of salts and/or other charge- 
shielding ions in the reaction solution. 

The concentrations at which oligonucleotide probes are used in assays of this type 
are well known in the art, and are discussed above. One example of a common approach 
to optimizing an oligonucleotide concentration is to choose a starting amount of 
oligonucleotide for pilot tests; 0.01 to 2 |aM is a concentration range used in many 
oligonucleotide-based assays. When initial cleavage reactions are performed, the 
following questions may be asked of the data: Is the reaction performed in the absence of 
the target nucleic acid substantially free of the cleavage product?; Is the site of cleavage 
specifically positioned in accordance with the design of the INVADER oligonucleotide?; 
Is the specific cleavage product easily detected in the presence of the uncleaved probe (or 
is the amount of uncut material overwhehning the chosen visualization method)? 

A negative answer to any of these questions would suggest that the probe 
concentration is too high, and that a set of reactions using serial dilutions of the probe 
should be performed until the appropriate amount is identified. Once identified for a 
given target nucleic acid in a give sample type (e.g., purified genomic DNA, body fluid 
extract, lysed bacterial extract), it should not need to be re-optimized. The sample type is 
important because the complexity of the material present may influence the probe 
concentration optimum. 

Conversely, if the chosen initial probe concentration is too low, the reaction may 
be slow, due to inefficient hybridization. Tests with increasing quantities of the probe 
will identify the point at which the concentration exceeds the optimum (e.g., at which it 
produces an undesirable effect, such as background cleavage not dependent on the target 
sequence, or interference with detection of the cleaved products). Since the hybridization 
will be facilitated by excess of probe, it is desirable, but not required, that the reaction be 
performed using probe concentrations just below this point. 
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The concentration of ESTVADER oligonucleotide can be chosen based on the 
design considerations discussed above. In some embodiments, the INVADER 
ohgonucleotide is in excess of the probe ohgonucleotide. In a preferred embodiment, the 
probe oligonucleotide is in excess of the INVADER ohgonucleotide. 
5 Temperature is also an important factor in the hybridization of ohgonucieotides. 

The range of temperature tested will depend in large part on the design of the 
ohgonucieotides, as discussed above. Where it is desired to have a reaction be run at a 
particular temperature (e.g., because of an enzyme requirement, for convenience, for 
compatibility with assay or detection apparatuses, etc.), the ohgonucieotides that function 
10 in the reaction can be designed to optimally perform at the desired reaction temperature. 
Each INVADER reaction includes at least two target sequence-specific ohgonucieotides 
b for the primary reaction: an upstream INVADER oligonucleotide and a downstream 

probe oligonucleotide. In some preferred embodiments, the INVADER ohgonucleotide 
'I? is designed to bmd stabily at the reaction temperature, while the probe is designed to 
W 15 freely associate and disassociate with the target strand, with cleavage occurring only 
^ when an uncut probe hybridizes adjacent to an overlapping INVADER ohgonucleotide. 
0 In preferred embodiments, the probe includes a 5' flap that is not complementary to the 
fl target, and this flap is released from the probe when cleavage occui*s. The released flap 
can be detected directly or indirectly. In some preferred embodiments, as discussed in 
n 20 detail below, the released flap participate as in INVADER ohgonucleotide in a secondary 
reaction. 

Optimum conditions for the INVADER assay are generally those that allow 
specific detection of the smallest amount of a target nucleic acid. Such conditions may 
be characterized as those that yield the highest target-dependent signal in a given 
25 timeframe, or for a given amount of target nucleic acid, or that allow the highest rate of 
probe cleavage (i.e., probes cleaved per minute). 

As noted above, the concentration of the cleavage agent can affect the actual 
optimum temperature for a cleavage reaction. Additionally, different cleavage agents, 
even if used at identical concentrations, can affect reaction temperature optima differently 
30 (e.g. , the difference between the calculated probe Tm and the observed optimal reaction 
temperature may be greater for one enzyme than for another). Determination of 
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appropriate salt corrections for reactions using different enzymes or concentrations of 
enzymes, or for any other variation made in reaction conditions, involves a two step 
process of a) measuring reaction temperature optima under the new reaction conditions, 
and varying the salt concentration within the Tm algorithm to produce a calculated 
temperature matching or closely approximating the observed optima. Measurement of an 
optimum reaction temperature generally involves performing reactions at a range of 
temperatures selected such that the range allows observation of an increase in 
performance as an optimal temperature is approached (either by increasing or decreasing 
temperatures), and a decrease in performance when an optimal temperature has been 
passed, thereby allowing identification of the optimal temperature or temperature range 
(See e.g., Lyamichev, et al., Biochemistry 39: 9523 [2000]). 

In some embodiments, a secondary reaction is used where the released cleavage 
fragment from a primary reaction hybridizes to a synthetic cassette to form a secondary 
cleavage reaction. In some preferred embodiments, the cassette comprises a fluorescing 
moiety and a quenching moiety, wherein cleavage of the secondary cleavage structure 
separates the fluorescing moiety from the quenching moiety, resulting in a detectable 
signal (e.g., FRET detection). The secondary reaction can be configured a number of 
different ways. For example, in some embodiments, the synthetic cassette comprises two 
oUgonucleotides: an oligonucleotide that contains the FRET moieties and a 
FRET/INVADER oUgonucleotide bridging oUgonucleotide that allows the INVADER 
oUgonucleotide (i.e., the released flap from the primary reaction) and the FRET 
oUgonucleotide to hybridize thereto, such that a cleavage structure is formed. In some 
embodiments, the synthetic cassette is provided as a single oUgonucleotide, comprising a 
hairpin structure (i.e., the FRET oUgonucleotide is connected at its 3' end to the bridging 
oUgonucleotide by a loop). The loop may be nucleic acid, or a non-nucleic acid spacer or 
linker. The linked molecules may together be described as a FRET cassette. Li the 
secondary reaction using a FRET cassette the released flap from the primary reaction, 
which acts as an INVADER oUgonucleotide, should be able to associate and disassociate 
with the FRET cassette freely, so that one released flap can direct the cleavage of 
multiple FRET cassettes. It is one aspect of the assay design that all of the probe 
sequences may be selected to allow the primary and secondary reactions to occur at the 
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same optimal temperature, so that the reaction steps can run simultaneously. In an 
altemative embodiment, the probes may be designed to operate at different optimal 
temperatures, so that the reaction steps are not simultaneously at their temperature 
optima. As noted above, the same iterative process used to select the ASR of the probe 
can be used in the design of the portion of the primary probe that participates in a 
secondary reaction. 

Another determinant of hybridization efficiency is the salt concentration of the 
reaction. In large part, the choice of solution conditions will depend on the requirements 
of the cleavage agent, and for reagents obtained commercially, the manufacturer's 
instructions are a resource for this information. When developing an assay utiUzing any 
particular cleavage agent, the oligonucleotide and temperature optimizations described 
above should be performed in the buffer conditions best suited to that cleavage agent. 

In some embodiments, additional agents may be included in reaction mixtures to 
enhance assay performance. For example, charged compounds such as aminoglycosides 
and other polyamines have been used to modulate DNA and RNA conformation and 
function (see, e.g., Eamshaw and Gait, Nucl. Acids Res. 26:5551 [1998]; Robinson and 
Wang, Nucl. Acids Res. 24:676 [1996]; Jerinie, J. Mol. Biol. 304(5):707 [2000]; 
Schroeder et al, EMBO 19(1):1 [2000]). hiclusion of the aminoglycoside antibiotic 
neomycin sulfate (e.g., at 1 ^iM in a primary reaction) can enhance assay performance by, 
e.g., reducing background signal, and therefore reducing the limit of detection of a 
particular INVADER assay probe set. Compounds of this type that may find use in 
INVADER assay reactions include, but are not limited to, aminoglycosides, oligomerized 
aminoglycosides, and aminoglycoside bioconjugates, and other polyanions including, but 
not limited to, spermine and hexaamine cobalt. 

A "no enzyme" control allows the assessment of the stability of the labeled 
oligonucleotides under particular reaction conditions, or in the presence of the sample to 
be tested e.g., in assessing the sample for contaminating nucleases). In this manner, the 
substrate and oUgonucleotides are placed in a tube containing all reaction components, 
except the enzyme and treated the same as the enzyme-containing reactions. Other 
controls may also be included. For example, a reaction with all of the components except 
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the target nucleic acid will serve to confirm the dependence of the cleavage on the 
presence of the target sequence. 

d) Selection of a Cleavage Agent 

As demonstrated in a number of the Examples, some 5* nucleases do not require 
5 an upstream oUgonucleotide to be active in a cleavage reaction. Although cleavage may 
be slower without the upstream oligonucleotide, it may still occur (Lyamichev et aL, 
Science 260:778 [1993], Kaiser et al, J. Biol Chem., 274:21387 [1999]). When a DNA 
strand is the template or target strand to which probe oligonucleotides are hybridized, the 
5' nucleases derived fi-om DNA polymerases and some flap endonucleases (FENs), such 
10 as that firom Methanococcus jannaschii, can cleave quite well without an upstream 
i-I oligonucleotide providing an overlap (Lyamichev et al.. Science 260:778 [1 993], Kaiser 
O et al., J. Biol. Chem., 274:21387 [1999], and US Patent No. 5,843,669, herein 
S. incorporated by reference in its entirety). These nucleases may be selected for use in 
['J some embodiments of the INVADER assay, e.g., in embodiments wherein cleavage of 
# 1 5 the probe in the absence of an INVADER oligonucleotide gives a different cleavage 
p product, which does not interfere with the intended analysis, or wherein both types of 
Jl cleavage, INVADER oUgonucleotide-directed and INVADER oligonucleotide- 
® independent, are intended to occur. 

S| In other embodiments it is preferred that cleavage of the probe be dependent on 

20 the presence of an upstream INVADER oligonucleotide, and enzyme having this 

requirement would be used. Other FENs, such as those from Archeaoglobus fulgidus 
(Afii) and Pyrococcus furiosus (Pfu), cleave an overlapped structure on a DNA target at 
so much greater a rate than they do a non-overlapping structure (z.e., either missing the 
upstream oligonucleotide or having a non-overlapping upstream oligonucleotide) that 
25 they can be viewed as having an essentially absolute requirement for the overlap 
(Lyamichev et al., Nat. Biotechnol., 17:292 [1999], Kaiser et al, J. Biol. Chem., 
274:21387 [1999]). When an RNA target is hybridized to DNA oligonucleotide probes 
to form a cleavage structure, many FENs cleave the downstream DNA probe poorly, 
regardless of the presence of an overlap. On such an RNA-containing structure, the 5' 
30 nucleases derived from DNA polymerases have a strong requirement for the overlap, and 
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are essentially inactive in its absence. The selection of enymes for use in the detection of 
RNA targets is discussed in more detail below, in Section IV: Improved Enzymes For 
Use In INVADER Ohgonucleotide-Directed Cleavage Reactions Comprising RNA 
Targets. 

e) Probing For Multiple Alleles 

The INTVADER oHgonucleotide-directed cleavage reaction is also useful in the 
detection and quantification of individual variants or alleles in a mixed sample 
population. By way of example, such a need exists in the analysis of tumor material for 
mutations in genes associated with cancers. Biopsy material from a tumor can have a 
significant complement of normal cells, so it is desirable to detect mutations even when 
present in fewer than 5% of the copies of the target nucleic acid in a sample. In this case, 
it is also desirable to measure what fraction of the population carries the mutation. 
Similar analyses may also be done to examine allelic variation in other gene systems, and 
it is not intended that the method of the present invention by hmited to the analysis of 
tumors. 

As demonstrated below, in one embodiment, reactions can be perfomied under 
conditions that prevent the cleavage of probes bearing even a single-nucleotide difference 
mismatch, but that permit cleavage of a similar probe that is completely complementary 
to the target in this region. In a preferred embodiment, a mismatch is positioned at the 
nucleotide in the probe that is 5' of the site where cleavage occurs in the absence of the 
mismatch. 

In other embodiments, the INVADER assay may be performed under conditions 
that have a tight requirement for an overlap (e.g., using the Afu FEN for DNA target 
detection or the 5' nuclease of DNA polymerase for RNA target detection, as described 
above), providing an alternative means of detecting single nucleotide or other sequence 
variations. In one embodiment, the probe is selected such that the target base suspected 
of varying is positioned at the 5' end of the target-complementary region of this probe. 
The upstream INYADER oligonucleotide is positioned to provide a single base of 
overlap. If the target and the probe ohgonucleotide are complementary at the base in 
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question, the overlap forms and cleavage can occur. However, if the target does not 
complement the probe at this position, that base in the probe becomes part of a non- 
complementary 5' arm, no overlap between the INVADER oligonucleotide and probe 
oligonucleotide exists, and cleavage is suppressed. 
5 It is also contemplated that different sequences may be detected in a single 

reaction. Probes specific for the different sequences may be differently labeled. For 
example, the probes may have different dyes or other detectable moieties, different 
lengths, or they may have differences in net charges of the products after cleavage. When 
V! differently labeled in one of these ways, the contribution of each specific target sequence 
10 to final product can be taUied, This has appUcation in detecting the quantities of different 
versions of a gene within a mixture. Different genes in a mixture to be detected and 
p quantified may be wild type and mutant genes (e.g., as may be found in a tumor sample, 

i| such as a biopsy). In this embodiment, one might design the probes to precisely the same 
: J site, but one to match the wild-type sequence and one to match the mutant. Quantitative 
\4 15 detection of the products of cleavage fi-om a reaction performed for a set amount of time 

will reveal the ratio of the two genes in the mixture. Such analysis may also be 
Q performed on unrelated genes in a mixture. This type of analysis is not intended to be 

1% limited to two genes. Many variants within a mixture may be similarly measured, 

^ Alternatively, different sites on a single gene may be monitored and quantified to 

ill 20 verify the measurement of that gene. In this embodiment, the signal fi*om each probe 
would be expected to be the same. 

It is also contemplated that multiple probes may be used that are not differently 
labeled, such that the aggregate signal is measured. This may be desirable when using 
many probes designed to detect a single gene to boost the signal from that gene. This 
25 configuration may also be used for detecting unrelated sequences within a mix. For 

example, in blood banking it is desirable to know if any one of a host of infectious agents 
is present in a sample of blood. Because the blood is discarded regardless of which agent 
is present, different signals on the probes would not be required in such an application of 
the present invention, and may actually be undesirable for reasons of confidentiality. 
30 Just as described for the two-oligonucleotide system, above, the specificity of the 

detection reaction will be influenced by the aggregate length of the target nucleic acid 
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sequences involved in the hybridization of the complete set of the detection 
oligonucleotides. For example, there may be applications in which it is desirable to 
detect a single region within a complex genome. In such a case the set of 
ohgonucleotides may be chosen to require accurate recognition by hybridization of a 
5 longer segment of a target nucleic acid, often in the range of 20 to 40 nucleotides. In 
other instances it may be desirable to have the set of ohgonucleotides interact with 
multiple sites within a t^get sample. In these cases one approach would be to use a set 
of oligonucleotides that recognize a smaller, and thus statistically more common, 
segment of target nucleic acid sequence. 
10 In one preferred embodiment, the INVADER and stacker oligonucleotides may be 

designed to be maximally stable, so that they will remain bound to the target sequence for 

0 extended periods during the reaction. This may be accomphshed through any one of a 

Q 

number of measures well known to those skilled in the art, such as adding extra 
hybridizing sequences to the length of the ohgonucleotide (up to about 50 nts in total 

1 jf 1 5 length), or by using residues with reduced negative charge, such as phosphorothioates or 
7 peptide-nucleic acid residues, so that the complementary strands do not repel each other 
O to degree that natural strands do. Such modifications may also serve to make these 

f^ flanking ohgonucleotides resistant to contaminating nucleases, thus further ensuring their 

m. 

^ continued presence on the target strand during the course of the reaction. In addition, the 
t^ 20 INVADER and stacker ohgonucleotides may be covalently attached to the target (e.g., 
through the use of psoralen cross-linking). 

f) Applications for pooled DNA and RNA samples 

In some embodiments, the present invention provides methods and kits for 
assaying a pooled sample using INVADER detection reagents (e.g. primary probe, 
25 INVADER probe, and FRET cassette). In some preferred embodiments, the kit 

comprises instructions on how to perform the INVADER assay and specifically how to 
apply the INVADER detection assay to pooled samples from many individuals, or to 
"pooled" samples firom many cells (e.g. from a biopsy sample) firom a single subject. 
In particular embodiments, the present invention allows detection of 
30 polymorphims in pooled samples combined from many individuals in a population (e.g. 
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10, 50, 100, or 500 individuals), or from a single subject where the nucleic acid 
sequences are from a large nxmiber of cells that ^e assayed at once. In this regard, the 
present invention allows the frequency of rare mutations in pooled samples to be detected 
and an allele frequency for the population estabUshed. In some embodiments, this allele 
frequency may then be used to statistically analyze the results of applying the INVADER 
detection assay to an individual's frequency for the polymorphism (e,g. determined using 
the INVADER assay). In this regard, mutations that rely on a percent of mutants found 
(e.g. loss of heterozygozity mutations) may be analyzed, and the severity of disease or 
progression of a disease determined (See, e.g. US Patent 6,146,828 and 6,203,993 to 
Lapidus, hereby incorporated by reference for all purposes, where genetic testing and 
statistical analysis are employed to find disease causing mutations or identify a patient 
sample as containing a disease causing mutations). 

In some embodiments of the present invention, broad population screens are 
performed. In some preferred embodiments, pooling DNA from several hundred or a 
thousand individuals is optimal. In such a pool, for example, DNA from any one 
individual would not be detectable, and any detectable signal would provide a measure of 
frequency of the detected allele in a broader population. The amount of DNA to be used, 
for example, would be set not by the number of individuals in a pool, but rather by the 
allele frequency to be detected. For example, in some embodiments, an assay gives 
ample signal from 20 to 40 ng of DNA in a 90 minute reaction. At this level of 
sensitivity, analysis of 1 |ag of DNA from a high-complexity pool would produce 
comparable signal from alleles present in only about 3-5% of the population. 

g) Applications of RNA detection. 

RNA quantitation is becoming increasingly important in basic, pharmaceutical, 
and clinical research. For example, quantitation of viral RNAs can predict disease 
progression and therapeutic efficacy. Likewise, gene expression analysis of diseased vs. 
normal, or untreated vs. treated, tissue can identify relevant biological responses or assess 
the effects of pharmacological agents. As the focus of the Human Genome Project 
moves toward gene expression analysis, the field will require a flexible RNA analysis 
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technology that can quantitatively monitor multiple forms of altematively transcribed 
and/or processed RNAs. 

As decribed above for the detection of multiple alleles, multiplex formats of the 
RNA INVADER assay enable simultaneous expression analysis of two or more genes 
5 within the same sample. In a primary reaction, one-nucleotide overlap-substrates are 
generated by the hybridization of INVADER oligonucleotides and probe ohgonucleotides 
to their respective RNA targets. Each probe contains a specific, target-complementary 
region and a distinctive non-complementary 5' flap that is associated only with that 
specific mRNA in that assay. The distinctive flaps may be distinguished in any of the 
10 myriad ways disclosed herein (e.g., with different labels, different secondary cleavage 
E systems having different labels, specific antibodies, different sizes when resolved, 

CI differenct sequences detected by hybridization in solution or on surfaces, etc.) 
m While the RNA invasive cleavage assay, like the method used for DNA detection 

JSJ described above, can use two invasive cleavage reactions in sequence (described below, 
IM 15 in Section n of the Detailed Description of the Invention), its preference for the 5' 
J ' nucleases derived fi-om DNA polymerases (described in detail in Section IV of the 
iJj Detailed Description of the Invention) indicates that additional format changes are 
f| preferred. Unlike the FEN 5' nucleases generally used for detection of DNA targets, 
p optimal signal amplification with the DNA Pol-related 5' nucleases occurs only when a 
20 probe turnover mechanism is employed in both the primary and secondary reactions (in 
contrast to an INVADER oligonucleotide turnover mechanism, wherein an INVADER 
oligonucleotide cycles, e.g., to direct the cleavage of multiple FRET cassettes, as 
described below, in Section II of this Detailed Description). Consequently, in preferred 
embodiments, RNA detection uses sequential operation of the two reactions, rather than 
25 simultaneous reaction performance. Because the reactions are performed truly 

sequentially, in these embodiments, the RNA INVADER assay signal accumulates 
linearly in both a target- and time-dependent manner. In contrast, the primary and 
secondary reactions of the DNA INVADER assay, when run concurrently, amplify signal 
as a linear fimction of target level, but as a quadratic fimction of time. In the sequential 
30 embodiments, the RNA INVADER assay uses two separate oligonucleotides, a secondary 
probe (e.g., a FRET probe) and secondary target, for signal generation. 
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A key feature of the RNA invasive cleavage assay is its ability to discriminate 
highly homologous RNA sequences, such as those found in cytochrome P450 gene 
famiUes. Like the DNA ESTVADER assay, the RNA INVADER assay can discriminate 
single-base changes. Li some embodiments, the first 5' complementary base of each 
5 probe is positioned at a non-conserved site in its mRNA target, so that a mismatch 
prevents formation of the overlap-structure, and thus prevents cleavage of the probe. 
Altematively spliced mRNA variants can be specifically detected by positioning the 
cleavage site at a sphce junction. 

To monitor large changes in mRNA levels, the dynamic raage of the assay can be 
10 extended using real-time analysis. However, since the assay generates signal linearly 
with time or target level, simply varying the amount of sample added per reaction and 
calculating the copies of mRNA per ng total RNA enables accurate quantitation with a 

El 

p single endpoint measurement on low-cost instrumentation. Further, in cases where 

absolute quantitation is not necessary, the assays linear signal amphfication mechanism 

& 15 and reproducibility also eliminate the need for a standard curve and enable simple and 

Ml ... . . ^ 

i|i precise relative quantitation of any one gene. 

JU. The RNA INVADER assay is particularly suited for detecting altematively 

fM sphced or edited RNA variants because even a single base change at the overlap site 

@ affects 5' nuclease cleavage. All areas requiring RNA quantitation, such as high- 

:p: 20 throughput screening in drug discovery research, monitoring drug metabolism and safety 

It 

in clinical trials, and clinical load monitoring of viral RNA can use this technology. 
Splice variants can be monitored in at least two ways with the assay: 1) detection of an 
individual exon or 2) detection of a specific spUce junction. 

To examine an RNA population for variants having more or fewer exons after 

25 splicing, INVADER assay probe sets are designed for each of the exons of interest (or for 
all exons in the mature RNA). Quantitation of exons, independent of how many mRNAs 
they reside in, may provide information about the number of splice variants for a given 
gene, as well as indicate the levels of expression for each exon. Mini in vitro transcripts 
containing only one or a few exons can be generated for each probe set so that absolute 

30 quantitation can be performed for each exon, thus enabling accurate comparisons of exon 
levels. If it is known that a particular exon is present in all known variants, in some 
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embodiments, a probe set is designed for that exon for use as an internal control to 
normalize across different samples. RNAs having a one copy of each exon (e.g., 
"normally spliced" RNA) should produce signal from the collection of probe sets in 
certain relative amounts (which should be esseintially equal for all exons, corrected for 
5 variations in the sensitivity of individual probe sets; see Section V). Alterations in 
splicing alter the relative amounts of the exons. For example, if all of the produced 
RNAs are missing one of the normal exons, the signal for that exon drops toward zero, 
while if half of the RNAs are missing that exon, the signal for that exon drops toward 
50%. More complex combinations of splice variations and mixtures of differently 
> 10 sphced mRNAs yield more complex and more informative profiles. Detection is not 
limited to exons. RNA populations may also be monitored for the presence of intron 

I I: 

m sequences that are usually removed by splicing. Such global exon screening provides 

y biologically relevant or diagnostic information when comparing normal vs. diseased 

'St 

tissue or untreated vs. treated cells (e.g., in pharmacogenomic screening assays). An 
y 15 array-based description of this type of measurement is referred to as alternative splicing 

detector airays (D. Black, Cell 103:367 [2000]). It is contemplated that the mRNA 
13 INVADER assay gives similar results but with greater specijficity and more accurate 
f| quantitation than the oligonucleotide array formats. 

J In an alternative embodiment, alternatively spliced mRNAs is detected by 

fU 20 examiniation of specific splice junctions. The advantage of monitoring the splice sites, as 
opposed to the exons themselves, is that even splice variants involving very small exons 
(e.g. < 10 nts) are accurately detected with the assay. 

Additionally, in some embodiments, the mRNA INVADER assay isalso used to 
monitor altemative start and stop sites in the mRNA, and is used to monitor lifetimes of 
25 processed and unprocessed RNAs and RNA fragments (e.g., as used in timecourse 
studies following induction). 

II. Signal Enhancement By Incorporating The Products Of An Invasive 
Cleavage Reaction Into A Subsequent Invasive Cleavage Reaction 

30 
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As noted above, the oligonucleotide product released by the invasive cleavage can 
be used subsequently in any reaction or read-out method that uses oligonucleotides in the 
size range of a cleavage product. In addition to the reactions involving primer extension 
and transcription, described herein, another enzymatic reaction that makes use of 
5 oligonucleotides is the invasive cleavage reaction. The present invention provide means 
of using the oUgonucleotide released in a primary invasive cleavage reaction as a 
component to complete a cleavage structure to enable a secondary invasive cleavage 
reaction. ITis not intended that the sequential use of the invasive cleavage product be 
" Umited to a single additional step. It is contemplated that many distinct invasive cleavage 

' 1 0 reactions may be performed in sequence. 

The polymerase chain reaction uses a DNA replication method to create copies of 
N= a targeted segment of nucleic acid at a logarithmic rate of accumulation. This is made 

P| possible by the fact that when the strands of DNA are separated, each individual strand 
^ contains sufficient information to allow assembly of a new complementary strand. When 
01 1 5 the new strands are synthesized the inmiber of identical molecules has doubled. Within 
Ji 20 iterations of this process, the original may be copied 1 milhon-fbld, making very rare 
1^ sequences easily detectable. The mathematical power of a doubUng reaction has been 
f| incorporated into a number of amplification assays. 

i«| By performing multiple, sequential invasive cleavage reactions the method of the 

gl 20 present invention c^tures an exponential mathematical advantage without producing 
additional copies of the target analyte. In a simple invasive cleavage reaction the yield, 
Y, is simply the tumover rate, K, multipHed by the time of the reaction, t {Le., Y = 
(K)(t)). If Y is used to represent the yield of a simple reaction, then the yield of a 
compound (i.e,, a multiple, sequential reaction), assuming that each of the individual 
25 invasive cleavage steps has the same tumover rate, can be simply represented as Y", 
where n is the number of invasive cleavage reactions that have been performed in the 
series. If the yields of each step differ the ultimate yield can be represented as the 
product of the multiplication of the yields of each individual reaction in the series. For 
example, if a primary invasive cleavage reaction can produce one thousand products in 
30 30 minutes, and each of those products can in turn participate in 1000 additional 

reactions, there will be 1000^ copies (1000 x 1000) of the ultimate product in a second 

108 



■3 

reaction. If a third reaction is added to the series, then the theoretical yield will be 1000 
(1000 X 1000 X 1000). In the methods of the present invention the exponent comes from 
the number of invasive cleavage reactions in the cascade. This can be contrasted to the 
amplification methods described above (e.g. , PGR) in which Y is Umited to 2 by the 
5 number of strands in duplex DNA, and the exponent n is the number of cycles performed, 
so that many iterations are necessary to accumulate large amounts of product. 

To distinguish the exponential amplifications described above from those of the 
present invention, the former can be considered reciprocating reactions because the 
products the reaction feed back into the same reaction (e.g., event one leads to some 
' 10 number of events 2, and each event 2 leads back to some number of events 1). In 

contrast, the events in some embodiments of the present invention are sequential (e.g., 
P event 1 leads to some number of events 2; each event 2 leads to some number of events 3, 
Sj etc., and no event can contribute to an event earlier in the chain). 
;I=J The sensitivity of the reciprocating methods is also one of the greatest weaknesses 

iji 15 when these assays are used to determine if a target nucleic acid sequence is present or 
absent in a sample. Because the product of these reactions is detectable copies of the 
Q starting material, contamination of a new reaction with the products of an earUer reaction 
H|| can lead to false positive results, (i.e., the apparent detection of the target nucleic acid in 

^ samples that do not actually contain any of that target analyte). Furthermore, because the 

lip 20 concentration of the product in each positive reaction is so high, amounts of DNA 

sufficient to create a strong false positive signal can be communicated to new reactions 
very easily either by contact with contaminated instruments or by aerosol. In contrast to 
the reciprocating methods, the most concentrated product of the sequential reaction (i.e., 
the product released in the ultimate invasive cleavage event) is not capable of initiating a 
25 like reaction or cascade if carried over to a fresh test sample. This is a marked advantage 
over the exponential amplification methods described above because the reactions of the 
present invention may be performed without the costly containment arrangements (e.g., 
either by specialized instruments or by separate laboratory space) required by any 
reciprocating reaction. While the products of a penultimate event may be inadvertently 
30 transferred to produce a background of the ultimate product in the absence of the a target 
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analyte, the contamination would need to be of much greater volume to give an 
equivalent risk of a false positive result. 

When the term sequential is used it is not intended to hmit the invention to 
configurations in which that one invasive cleavage reaction or assay must be completed 
before the initiation of a subsequent reaction for invasive cleavage of a different probe. 
Rather, the term refers to the order of events as would occur if only single copies of each 
of the ohgonucleotide species were used in an assay. The primary invasive cleavage 
reaction refers to that which occurs first, in response to the formation of the cleavage 
structure on the target nucleic acid. Subsequent reactions may be referred to as 
secondary, tertiary and so forth, and may involve artificial "target" strands that serve only 
to support assembly of a cleavage structure, and which are unrelated to the nucleic acid 
analyte of interest. While the complete assay may, if desired, be configured with each 
step of invasive cleavage separated either in space (e.g., in different reaction vessels) or 
in time (e.g., using a shift in reaction conditions, such as temperature, enzyme identity or 
solution condition, to enable the later cleavage events), it is also contemplated that all of 
the reaction components may be mixed so that secondary reactions may be initiated as 
soon as product from a primary cleavage becomes available. In such a format, primary, 
secondary and subsequent cleavage events involving different copies of the cleavage 
structures may take place simultaneously. 

Several levels of this sort of linear amplification can be envisioned, in which each 
successive round of cleavage produces an ohgonucleotide that can participate in the 
cleavage of a different probe in subsequent rounds. The primary reaction would be 
specific for the analyte of interest with secondary (and tertiary, etc.) reactions being used 
to generate signal while still being dependent on the primary reaction for initiation. 

The released product may perform in several capacities in the subsequent 
reactions. For example, the product of one invasive cleavage reaction becomes the 
INVADER oligonucleotide to direct the specific cleavage of another probe in a second 
reaction. In such an example, the first invasive cleavage structure is formed by the 
annealing of the INVADER ohgonucleotide and the probe ohgonucleotide (Probe 1) to 
the first target nucleic acid (Target 1). The target nucleic acid is divided into three 
regions based upon which portions of the INVADER and probe oligonucleotides are 
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capable of hybridizing to the target. Region 1 of the target has complementarity to only 
the INVADER oUgonucleotide; region 3 of the target has complementarity to only the 
probe; and region 2 of the target has an overlap between the INVADER and probe 
oligonucleotides. 

5 Cleavage of Probe 1 releases the "Cut Probe 1". The released Probe 1 is then 

used as the INVADER oligonucleotide in second cleavage. The second cleavage 
structure is formed by the annealing of the Cut Probe 1, a second probe oUgonucleotide 
("Probe 2") and a second target nucleic acid ("Target 2"). In some embodiments, Probe 2 
and the second target nucleic acid are covalently connected, preferably at their 3' and 5' 
10 ends, respectively, thus forming a hairpin stem and loop, termed herein a "cassette". The 
loop may be nucleic acidor a non-nucleic acid spacer or linker. Inclusion of an excess of 
Q the cassette molecule allows each Cut Probe 1 to serve as an INVADER to direct the 
IS cleavage of multiple copies of the cassette. 

;P Probe 2 may be labeled and detection of cleavage of the second cleavage structure 

ijl 1 5 may be accompUshed by detecting the labeled cut Probe 2; the label may a radioisotope 
''^^ (e.g., ^^P, ^^S), a fluorophore (e.g., fluorescein), a reactive group capable of detection by a 
p secondary agent (e.g., biotin/streptavidin), a positively charged adduct which permits 
ri detection by selective charge reversal (as discussed in Section IV above), etc. 
!^ Alternatively, the cut Probe 2 may used in a tailing reaction, or to complete or activate a 
|i| 20 protein-binding site, or may be detected or used by any of the means for detecting or 
using an oligonucleotide described herein. 

In other embodiments, probe oUgonucleotides that are cleaved in the primary 
reaction can be designed to fold back on themselves (i.e., they contain a region of self- 
complementarity) to create a molecule that can serve as both the INVADER and target 
25 oligonucleotide (termed here an "IT" complex). The IT complex then enables cleavage 
of a different probe present in the secondary reaction. Inclusion of an excess of the 
secondary probe molecule ("Probe 2"), allows each IT molecule to serve as the platform 
for the generation of multiple copies of cleaved secondary probe. The target nucleic acid 
is divided into three regions based upon which portions of the INVADER and probe 
30 oUgonucleotides are capable of hybridizing to the target (as discussed above and it is 
noted that the target may be divided into four regions if a stacker oligonucleotide is 
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employed). The second cleavage structure is formed by the annealing of the second 
probe ("Probe 2") to the fragment of Probe 1 ("Cut Probe 1") that was released by 
cleavage of the first cleavage structure. The Cut Probe 1 forms a hairpin or stem/loop 
structure near its 3' terminus by virtue of the annealing of the regions of self- 
5 complementarity contained within Cut Probe 1 (this self-annealed Cut Probe 1 forms the 
IT complex). The IT complex (Cut Probe 1) is divided into three regions. Region 1 of 
the IT complex has complementarity to the 3' portion of Probe 2; region 2 has 
complementarity to both the 3' end of Cut Probe 1 and to the 5' portion of Probe 2; and 
region 3 contains the region of self-complementarity (te.^ region 3 is complementary to 
10 the 3' portion of the Cut Probe 1). Note that with regard to the IT complex (i.e.. Cut 
Probe 1), region 1 is located upstream of region 2 and region 2 is located upstream of 

0 region 3. As for other embodiments of invasive cleavage, region "2" can represent a 

PI 

S region where there is a physical, but not sequence, overlap between the INVADER 
4^ oligonucleotide portion of the Cut Probe 1 and the Probe 2 oUgonucleotide. 

15 The cleavage products of the secondary invasive cleavage reaction (z.e., Cut Probe 

2) can either be detected, or can in turn be designed to constitute yet another integrated 
P INVADER-target complex to be used with a third probe molecule, again unrelated to the 
1^ preceding targets. 

!J! It is envisioned that the oligonucleotide product of a primary cleavage reaction 

n 20 may fill the role of any of the oligonucleotides described herein (e.g., it may serve as a 
target strand without an attached INVADER oligonucleotide-like sequence, or it may 
serve as a stacker oligonucleotide, as described above), to enhance the tumover rate seen 
in the secondary reaction by stabilizing the probe hybridization through coaxial stacking. 
Secondary cleavage reactions in some preferred embodiments of the present 
25 invention include the use of FRET cassettes. Such molecules provide both a secondary 
target and a FRET labeled cleavable sequence, allowing homogeneous detection (i.e., 
without product separation or other manipulation after the reaction) of the sequential 
invasive cleavage reaction. Other preferred embodiments use a secondary reaction 
system in which the FRET probe and synthetic target are provided as separate 
30 oligonucleotides. 
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In a preferred embodiment, each subsequent reaction is facilitated by {Le,, is 
dependent upon) the product of the previous cleavage, so that the presence of the ultimate 
product may serve as an indicator of the presence of the target analyte. However, 
cleavage in the second reaction need not be dependent upon the presence of the product 
5 of the primary cleavage reaction; the product of the primary cleavage reaction may 
merely measurably enhance the rate of the second cleavage reaction. 

In summary, the INVADER assay cascade (z.e., sequential invasive cleavage 
reactions) of the present invention is a combination of two or more linear assays that 
allows the accumulation of the ultimate product at an exponential rate, but without 
1 0 significant risk of carryover contamination. It is important to note that background that 
does not arise fi"om sequential cleavage, such as thermal breakage of the secondary probe, 
generally increases linearly with time. In contrast, signal generation from a 2-step 

'5 sequential reaction follows quadratic kinetics. Thus, collection of data as a time course, 

Ml 

4= either by taking time points or through the use of an instrument that allows real-time 
15 detection during the INVADER assay reaction incubations, provides the attractive 
capability of discriminating between the true signal and any background solely on the 

if 

P basis of quadratic versus Unear increases in signal over time. For example, when viewed 

graphically, the real signal will appear as a quadratic curve, while any accumulating 
W background will be linear, and thus easy to distinguish, even if the absolute level of the 
r| 20 background signal (e.g., fluorescence in a FRET detection format) is substantial. 

The sequential invasive cleavage amplification of the present invention can be 
used as an intermediate boost to any of the detection methods (e.g., gel based analysis by 
either standard or by charge reversal), polymerase tailing, and incorporation into a protein 
binding region, described herein. When used is such combinations the increased 
25 production of a specific cleavage product in the invasive cleavage assay reduces the 

burdens of sensitivity and specificity on the read-out systems, thus facilitating their use. 

In addition to enabling a variety of detection platforms, the cascade strategy is 
suitable for multiplex analysis of individual analytes {le., individual target nucleic acids) 
in a single reaction. The multiplex format can be categorized into two types. In one case, 
30 it is desirable to know the identity (and amount) of each of the analytes that can be 

present in a clinical sample, or the identity of each of the analytes as well as an internal 
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control. To identify the presence of multiple individual analytes in a single sample, 
several distinct secondary amplification systems may be included. Each probe cleaved in 
response to the presence of a particular target sequence (or intemal control) can be 
designed to trigger a different cascade coupled to different detectable moieties, such as 
5 different sequences to be extended by DNA polymerase or different dyes in an FRET 
format. The contribution of each specific target sequence to final product can thereby be 
taUied, allowing quantitative detection of different genes or alleles in a sample containing 
a mixture of genes or alleles. 

In the second configuration, it is desirable to determine if any of several analytes 
10 are present in a sample, but the exact identity of each does not need to be known. For 
example, in blood banking it is desirable to know if any one of a host of infectious agents 
is present in a sample of blood. Because the blood is discarded regardless of which agent 
=S{ is present, different signals on the probes would not be required in such an application of 
W the present invention, and may actually be undesirable for reasons of confidentiality. In 

15 this case, the 5' arms (i.e., the 5* portion that will be released upon cleavage) of the 
'i; different analyte-specific probes would be identical and would therefore trigger the same 
M secondary signal cascade. A similar configuration would permit multiple probes 

M 

111 complementary to a single gene to be used to boost the signal from that gene or to ensure 
U inclusivity when there are numerous alleles of a gene to be detected. 
i3 20 In the primary INVADER assay reaction, there are two potential sources of 

' background. The first is fi*om INVADER oligonucleotide-independent cleavage of probe 

annealed to the target, to itself, or to one of the other oUgonucleotides present in the 
reaction. The use of an enzyme that cannot efficiently cleave a structure that lacks a 
primer is preferred for this reason. The enzyme Pfu FEN-1 gives no detectable cleavage 
25 in the absence of the upstream oligonucleotide or even in the presence of an upstream 
oligonucleotide that fails to invade the probe-target complex. This indicates that the Pfu 
FEN-1 endonuclease is a suitable enzyme for use in the methods of the present invention. 

Other structure-specific nucleases may be suitable as a well. As discussed in the 
first example, some 5' nucleases can be used in conditions that significantly reduce this 
30 primer-independent cleavage. For exMiple, it has been shown that when the 5* nuclease 
of DNAPTaq is used to cleave hairpins the primer-independent cleavage is markedly 
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reduced by the inclusion of a monovalent salt in the reaction (Lyamichev, et al, [1993], 
supra), 

III. Effect of ARRESTOR Molecules on Signal and Background in Sequential 

Invasive Cleavage Reactions. 

As described above, the concentration of the probe that is cleaved can be used to 
increase the rate of signal accumulation, with higher concentrations of probe yielding 
higher fmal signal. However, the presence of large amounts of residual uncleaved probe 
can present problems for subsequent use of the cleaved products for detection or for 
further ampUfication. If the subsequent step is a simple detection {e.g., by gel 
resolution), the excess uncut material may cause background by streaking or scattering of 
signal, or by overwhelming a detector {e.g., over-exposing a film in the case of 
radioactivity, or exceeding the quantitative detection Umits of a fluorescence imager). 
This can be overcome by partitioning the product from the uncut probe. In more complex 
detection methods, the cleaved product may be intended to interact with another entity to 
indicate cleavage. As noted above, the cleaved product can be used in any reaction that 
makes use of oligonucleotides, such as hybridization, primer extension, Ugation, or the 
direction of invasive cleavage. In each of these cases, the fate of the residual uncut probe 
should be considered in the design of the reaction, hi a primer extension reaction, the 
uncut probe can hybridize to a template for extension. If cleavage is required to reveal 
the correct 3' end for extension, the hybridized uncut probe will not be extended. It may, 
however, compete with the cleaved product for the template. If the template is in excess 
of the combination of cleaved and uncleaved probe, then both of the latter should be able 
to find a copy of template for binding. If, however, the template is limiting, any 
competition may reduce the portion of the cleaved probe that can find successfully bind 
to the available template. If a vast excess of probe was used to drive the initial reaction, 
the remainder may also be in vast excess over the cleavage product, and thus may provide 
a very effective competitor, thereby reducing the amount of the fmal reaction (e.g., 
extension) product for ultimate detection. 

The participation of the uncut probe material in a secondary reaction can also 
contribute to background in these reactions. While the presentation of a cleaved probe 
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for a subsequent reaction may represent an ideal substrate for the enzyme to be used in 
the next step, some enzymes may also be able to act, albeit inefficiently, on the uncut 
probe as well. It was shown during the development of tiie present invention that 
transcription can be promoted from a nicked promoter even when one side of the nick has 
additional unpaired nucleotides. Similarly, when the subsequent reaction is to be an 
invasive cleavage, the uncleaved probe may bind to the elements intended to form the 
second cleavage structure with the cleaved probe, hi experiments conducted during the 
development of the preseait invention, it was found that some of the 5' nucleases 
described herein can catalyze some measure of cleavage of defective structures. Even at 
a low level, this aberrant cleavage can be misinterpreted as positive target-specific 
cleavage signal. 

With these negative effects of the surfeit of uncut probe considered, there is 
clearly a need for some meliiod of preventing these interactions. As noted above, it is 
possible to partition the cleaved product from the uncut probe after the primary reaction 
by traditional methods. However, these methods are often time consummg, may be 
expensive {e.g., disposable columns, gels, etc.), and may increase the risk for sample 
mishandling or contamination. It is far preferable to configure the sequential reactions 
such that the original sample need not be removed to a new vessel for subsequent 
reaction. 

The present invention provides a method for reducing interactions between the 
primary probe and other reactants. This method provides a means of specifically 
diverting the uncleaved probes from participation in the subsequent reactions. The 
diversion is accomplished by the inclusion in the next reaction step an agent designed to 
specifically interact with the uncleaved primary probe. While the primary probe in an 
invasive cleavage reaction is discussed for reasons of convenience, it is contemplated that 
the ARRESTOR molecules may be used at any reaction step within a chain of invasive 
cleavage steps, as needed or desired for tiie design of an assay. It is not intended that the 
ARRESTOR molecules of the present invention be limited to any particular step. 

The method of diverting the residual uncut probes from a primary reaction makes 
use of agents that can be specifically designed or selected to bind to the uncleaved probe 
molecules with greater affinity than to the cleaved probes, thereby allowing the cleaved 
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probe species to effectively compete for the elements of the subsequent reaction, even 
when the uncut probe is present in vast excess. These agents have been termed 
"ARRESTOR molecules," due to their function of stopping or arresting the primary 
probe from participation in the later reaction. In various Examples below, an 
5 oligonucleotide is provided as an ARRESTOR molecule in an invasive cleavage assay. It 
can be appreciated that any molecule or chemical that can discriminate between the full- 
length uncut probe and the cleaved probe, and that can bind or otherwise disable the 
uncleaved probe preferentially may be configured to act as ARRESTOR molecules 
within the meaning of the present invention. For example, antibodies can be derived with 
10 such specificity, as can the "aptamers" that can be selected through multiple steps of m 
vitro amphfication {e.g., "SELEX," U.S. Patent Nos. 5,270,163 and 5,567,588; herein 

p incorporated by reference) and specific rounds of capture or other selection means. 

S In one embodiment, the ARRESTOR molecule is an oligonucleotide. In another 

embodiment the ARRESTOR oUgonucleotides is a composite oligonucleotide, 

j|| 15 comprising two or more short oligonucleotides that are not covalently linked, but that 
bind cooperatively and are stabilized by co-axial stacking. In a preferred embodiment, 

[f the oligonucleotide is modified to reduce interactions with the cleavage agents of the 

Ql present invention. When an oligonucleotide is used as an ARRESTOR oligonucleotide, it 

iH 

IS is intended that it not participate in the subsequent reactive step. The binding of the 

20 ARRESTOR oligonucleotide to the primary probe may, either with the participation of 
the secondary target, or without such participation, create a bifurcated structure that is a 
substrate for cleavage by the 5' nucleases used in some embodiments of the methods of 
the present invention. Formation of such structures would lead to some level of 
unintended cleavage that could contribute to background, reduce specific signal or 
25 compete for the enzyme. It is preferable to provide ARRESTOR oligonucleotides that 
will not create such cleavage structures. One method of doing this is to add to the 
ARRESTOR oUgonucleotides such modifications as have been foimd to reduce the 
activity of INVADER oligonucleotides, as the INVADER oligonucleotides occupy a 
similar position within a cleavage structure (z.e., the 3' end of the INVADER 
30 oligonucleotide positions the site of cleavage of an unpaired 5' arm). Modification of the 
3' end of the INVADER oUgonucleotides was examined for the effects on cleavage in the 
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Example section below; a number of the modifications tested were found to be 
significantly debilitating to the function of the INVADER oligonucleotide. Other 
modifications not described herein may be easily characterized by performing such a test 
using the cleavage enzyme to be used in the reaction for which the ARRESTOR 
5 oligonucleotide is intended. 

In a preferred embodiment, the backbone of an ARRESTOR oligonucleotide is 
modified. This may be done to increase the resistance to degradation by nucleases or 
temperature, or to provide duplex structure that is a less favorable substrate for the 
enzyme to be used (e.g., A-form duplex vs. B-form duplex). In particularly preferred 
10 embodiment, the backbone-modified oligonucleotide further comprises a 3' terminal 
r=l modification. In a preferred embodiment, the modifications comprise 2' 0-methyl 
^ substitution of the nucleic acid backbone, while in a particularly preferred embodiment, 
4S the 2' 0-methyl modified oligonucleotide further comprises a 3' terminal amine group, 
rl The purpose of the ARRESTOR oUgonucleotide is to allow the minority 

w 1 5 population of cleaved probe to effectively compete with the uncleaved probe for binding 
Ip whatever elements are to be used in the next step. While an ARRESTOR oligonucleotide 
1.S that can discriminate between the two probe species absolutely (i.e., binding only to 

0 uncut and never to cut) may be of the greatest benefit in some embodiments, it is 

f I envisioned that in many applications, including the sequential INVADER assays 

20 described herein, the ARRESTOR oligonucleotide of the present invention may perform 
the intended function with only partial discrimination. When the ARRESTOR 
oUgonucleotide has some interaction with the cleaved probe, it may prevent detection of 
some portion of these cleavage products, thereby reducing the absolute level of signal 
generated fi-om a given amount of target material. If this same ARRESTOR 
25 oligonucleotide has the simultaneous effect of reducing the background of the reaction 
(i.e., from non-target specific cleavage) by a factor that is greater than the factor of 
reduction in the specific signal, then the significance of the signal (i.e., the ratio of signal 
to background), is increased, even with the lower amount of absolute signal. Any 
potential ARRESTOR molecule design may be tested in a simple fashion by comparing 
30 the levels of backgroxmd and specific signals from reactions that lack ARRESTOR 
molecules to the levels of background and specific signal from similar reactions that 
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include ARRESTOR oligonucleotides. What constitutes an acceptable level of tradeoff 
of absolute signal for specificity will vary for different applications {e.g., target levels, 
read-out sensitivity, etc.), and can be determined by any individual user using the 
methods of the present invention. In some embodiments, an ARRESTOR 
5 oHgonucleotide may be used to stop signal generation from one or more particular target 
nucleic acids (e.g., an intemal standard, or one that is present in higher abundance) so 
that incubation of the reaction mixture can be continued such that additional signal can 
accumulate from one or more other target nucleic acids in the same reaction vessel. 

10 IV. Improved Enzymes For Use In INVADER Oligonucleotide-Directed 
f% Cleavage Reactions Comprising RNA Targets; 

m 

:|« A cleavage structure is defined herein as a structure that is formed by the 

iZ interaction of a probe ohgonucleotide and a target nucleic acid to form a duplex, the 

'0 1 5 resulting structure being cleavable by a cleavage agent, including but not limited to an 

'Q enzyme. The cleavage structure is fixrther defined as a substrate for specific cleavage by 

;=| the cleavage means in contrast to a nucleic acid molecule that is a substrate for 

P nonspecific cleavage by agents such as phosphodiesterases, hi considering 

PI improvements to enzymatic cleavage agents, one may consider the action of said 

20 enzymes on any structures that fall within the definition of a cleavage structure. Specific 

cleavage at any site within such a structure is contemplated. 

hnprovements in an enzyme may be an increased or decreased rate of cleavage of 

one or more types of structures, hnprovements may also result in more or fewer sites of 

cleavage on one or more of said cleavage structures. In developing a library of new 
25 structure-specific nucleases for use in nucleic acid cleavage assays, improvements may 

have many different embodiments, each related to the specific substrate structure used in 

a particular assay. 

As an example, one embodiment of the INVADER oligonucleotide-directed 
cleavage assay of the present invention may be considered. In the INVADER 
30 oUgonucleotide-directed cleavage assay, the accumulation of cleaved material is 

influenced by several features of the enzyme behavior. Not surprisingly, the turnover 
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rate, or the number of structures that can be cleaved by a single enzyme molecule in a set 
amount of time, is very important in determining the amount of material processed during 
the course of an assay reaction. If an enzyme takes a long time to recognize a substrate 
{e.g., if it is presented with a less-than-optimal structure), or if it takes a long time to 
execute cleavage, the rate of product accumulation is lower than if these steps proceeded 
quickly. If these steps are quick, yet the enzyme "holds on" to the cleaved structure, and 
does not immediately proceed to another uncut structure, the rate will be negatively 
affected. 

Enzyme tumover is not the only way in which enzyme behavior can negatively 
affect the rate of accumulation of product. When the means used to visualize or measure 
product is specific for a precisely defined product, products that deviate from that 
definition may escape detection, and thus the rate of product accumulation may appear to 
be lower than it is. For example, if one had a sensitive detector for trinucleotides that 
could not see di- or tetranucleotides, or any sized oligonucleotide other that 3 residues, in 
the INVADER-directed cleavage assay of the present invention any errant cleavage 
would reduce the detectable signal proportionally. It can be seen from the cleavage data 
presented here that, while there is usually one site within a probe that is favored for 
cleavage, there are often products that arise from cleavage one or more nucleotides away 
from the primary cleavage site. These are products that are target-dependent, and are 
thus not non-specific background. Nevertheless, if a subsequent visualization system can 
detect only the primary product, these represent a loss of signal. One example of such a 
selective visualization system is the charge reversal readout presented herein, in which 
the balance of positive and negative charges determines the behavior of the products. In 
such a system the presence of an extra nucleotide or the absence of an expected 
nucleotide can exclude a legitimate cleavage product from ultimate detection by leaving 
that product with the wrong balance of charge. It can be easily seen that any assay that 
can sensitively distinguish the nucleotide content of aa ohgonucleotide, such as standard 
stringent hybridization, suffers in sensitivity when some fraction of the legitimate product 
is not eligible for successful detection by that assay. 

These discussions suggest two highly desirable traits in any enzyme to be used in 
the method of the present invention. First, the more rapidly the enzyme executes an 
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entire cleavage reaction, including recognition, cleavage and release, the more signal it 
may potentially created in the INVADER oligonucleotide-directed cleavage assay. 
Second, the more successful an enzyme is at focusing on a single cleavage site within a 
structure, the more of the cleavage product can be successfully detected in a selective 
5 read-out. 

The rationale cited above for making improvements in enzymes to be used in the 
INVADER oligonucleotide-directed cleavage assay are meant to serve as an example of 
one direction in which improvements might be sought, but not as a limit on either the 
nature or the applications of improved enzyme activities. As another direction of activity 
10 change that would be appropriately considered improvement, the DNAP-associated 5* 
nucleases may be used as an example. In creating some of the polymerase-deficient 5' 
% nucleases described herein it was found that the those that were created by deletion of 
I substantial portions of the polymerase domain, assumed activities that were weak or 

absent in the parent proteins. These activities included the ability to cleave non-forked 

\^ 

.^15 structures, a greatly enhanced ability to exonucleolytically remove nucleotides jfrom the 
5' ends of duplexed strands, and a nascent ability to cleave circular molecules without 
benefit of a free 5' end. 

In addition to the 5' nucleases derived from DNA polymerases, the present 
invention also contemplates the use of structure-specific nucleases that are not derived 
20 from DNA polymerases. For example, a class of eukaryotic and archaebacterial 
endonucleases have been identified which have a similar substrate specificity to 5' 
nucleases of Pol I-type DNA polymerases. These are the FENl (Flap EndoNuclease), 
RAD2, and XPG (Xeroderma Pigmentosa-complementation group G) proteins. These 
proteins are involved in DNA repair, and have been shown to favor the cleavage of 
25 structures that resemble a 5' arm that has been displaced by an extending primer during 
polymerization. Similar DNA repair enzymes have been isolated from single cell and 
higher eukaryotes and from archaea, and there are related DNA repair proteins in 
eubacteria. Similar 5' nucleases have also been associated with bacteriophage such as T5 
andT7. 

30 Recently, the 3-dimensional structures of DNAPTaq and T5 phage 5'-exonuclease 

were determined by X-ray diffraction (Kim et al. Nature 376:612 [1995]; and Ceska et 
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ah. Nature 382:90 [1995]). The two enzymes have very similar 3 -dimensional structures 
despite limited amino acid sequence similarity. The most striking feature of the T5 
5'-exonuclease structure is the existence of a triangular hole formed by the active site of 
the protein and two alpha helices. This same region of DNAPTaq is disordered in the 
crystal structure, indicating that this region is flexible, and thus is not shown in the 
published 3-dimensional structure. However, the 5' nuclease domain of DNAPTaq is 
likely to have the same structure, based its overall 3-dimensional similarity to T5 
S'-exonuclease, and that the amino acids in the disordered region of the DNAPTaq 
protein ^e those associated with alpha helix formation. The existence of such a hole or 
groove in the 5' nuclease domain of DNAPTaq was predicted based on its substrate 
specificity (Lyamichev et al, supra). 

It has been suggested that the 5' arm of a cleavage structure must thread through 
the helical arch described above to position said structure correctly for cleavage (Ceska et 
aL, supra). One of the modifications of 5* nucleases described herein opened up the 
helical arch portion of the protein to allow improved cleavage of structures that cut 
poorly or not at all (e.g., structures on circular DNA targets that would preclude such 
threading of a 5' arm). The gene construct that was chosen as a model to test this 
approach was the one called CLEAVASE BN, which was derived from DNAPTaq but 
does not contain the polymerase domain. It comprises the entire 5' nuclease domain of 
DNAP Taq, and thus should be very close in structure to the T5 5' exonuclease. This 5' 
nuclease was chosen to demonstrate the principle of such a physical modification on 
proteins of this type. The arch-opening modification of the present invention is not 
intended to be limited to the 5' nuclease domains of DNA polymerases, and is 
contemplated for use on any structure-specific nuclease that includes such an aperture as 
a limitation on cleavage activity. The present invention contemplates the insertion of a 
thrombin cleavage site into the helical arch of DNAPs derived from the genus Thermus as 
well as 5' nucleases derived from DNAPs derived from the genus Thermus. The specific 
example shown herein using the CLEAVASE BN/thrombin nuclease merely illustrates 
the concept of opening the hehcal arch located within a nuclease domain. As the amino 
acid sequence of DNAPs derived from the genus Thermus are highly conserved, the 
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teachings of the present invention enable the insertion of a thrombin site into the helical 
arch present in these DNAPs and 5' nucleases derived from these DNAPs. 

The opening of the helical arch was accomplished by insertion of a protease site 
in the arch. This allowed post-translational digestion of the expressed protein with the 
5 appropriate protease to open the arch at its apex. Proteases of this type recognize short 
stretches of specific amino acid sequence. Such proteases include thrombin and factor 
Xa. Cleavage of a protein with such a protease depends on both the presence of that site 
in the amino acid sequence of the protein and the accessibility of that site on the folded 
intact protein. Even with a crystal structure it can be difficult to predict the susceptibility 
10 of any particular region of a protein to protease cleavage. Absent a crystal structure it 
must be determined empirically. 

PI 

Q In selecting a protease for a site-specific cleavage of a protein that has been 

f modified to contain a protease cleavage site, a first step is to test the unmodified protein 
m for cleavage at altemative sites. For example, DNAPTaq and CLEAVASE BN nuclease 
i|l 1 5 were both incubated under protease cleavage conditions with factor Xa and thrombin 

proteases. Both nuclease proteins were cut with factor Xa within the 5' nuclease domain, 

fl but neither nuclease was digested with large amounts of thrombin. Thus, thrombin was 

II 

s4 chosen for initial tests on opening the arch of the CLEAVASE BN enzyme. 

y In the protease/ CLEAVASE modifications described herein the factor Xa 

20 protease cleaved strongly in an unacceptable position in the unmodified nuclease protein, 
in a region likely to compromise the activity of the end product. Other unmodified 
nucleases contemplated herein may not be sensitive to the factor Xa, but may be sensitive 
to thrombin or other such proteases. Altematively, they may be sensitive to these or 
other such proteases at sites that are immaterial to the function of the nuclease sought to 
25 be modified. In approaching any protein for modification by addition of a protease 
cleavage site, the unmodified protein should be tested with the proteases under 
consideration to determine which proteases give acceptable levels of cleavage in other 
regions. 

Working with the cloned segment of DNAPTaq from which the CLEAVASE BN 
30 protein is expressed, nucleotides encoding a thrombin cleavage site were introduced 

in-frame near the sequence encoding amino acid 90 of the nuclease gene. This position 



123 



was determined to be at or near the apex of the hehcal arch by reference to both the 
3-dimensional structure of DNAPTaq, and the structure of T5 5' exonuclease. The 
encoded amino acid sequence, LVPRGS, was inserted into the apex of the heUcal arch by 
site-directed mutagenesis of the nuclease gene. The proUne (P) in the thrombin cleavage 
site was positioned to replace a proline normally in this position in CLEAVASE BN 
because proline is an alpha helix-breaking amino acid, and may be important for the 
3-dimensional structure of this arch. This construct was expressed, purified and then 
digested with thrombin. The digested enzyme was tested for its abihty to cleave a target 
nucleic acid, bacteriophage M13 genomic DNA, that does not provide free 5' ends to 
facilitate cleavage by the threading model. 

While the helical arch in this nuclease was opened by protease cleavage, it is 
contemplated that a number of other techniques could be used to achieve the same end. 
For example, the nucleotide sequence could be rearranged such that, upon expression, the 
resulting protein would be configured so that the top of the hehcal arch (amino acid 90) 
would be at the amino terminus of the protein, the natural carboxyl and amino termini of 
the protein sequence would be joined, and the new carboxyl terminus would he at natural 
amino acid 89. This approach has the benefit that no foreign sequences are introduced 
and the enzyme is a single amino acid chain, and thus may be more stable that the 
cleaved 5' nuclease. Li the crystal structure of DNAPTaq, the amino and carboxyl 
termini of the 5'-exonuclease domain he in close proximity to each other, which suggests 
that the ends may be directly joined without the use of a flexible linker peptide sequence 
as is sometimes necessary. Such a rearrangement of the gene, with subsequent cloning 
and expression could be accomphshed by standard PGR recombination and cloning 
techniques known to those skilled in the art. 

The INVADER invasive cleavage reaction has been shown to be useful in the 
detection of RNA target strands {See e.g., U.S. Patent 6,001,567, incorporated herein by 
reference in its entirety). As with the INVADER assay for the detection of DNA 
(Lyamichev et al, Nat. BiotechnoL, 17:292 [1999]), the reactions may be run under 
conditions that permit the cleavage of many copies of a probe for each copy of the target 
RNA present in the reaction. In one embodiment, the reaction may be performed at a 
temperature close to the melting temperature (Tm) of the probe that is cleaved, such that 
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the cleaved and uncleaved probes readily cycle on and off the target straad without 
temperature cycling. Each time a full-length probe binds to the target in the presence of 
the INVADER oligonucleotide, it maybe cleaved by a 5* nuclease enzyme, resulting in 
an accumulation of cleavage product. The accumulation is highly specific for the 
sequence being detected, and may be configured to be proportional to both time and 
target concentration of the reaction, hi another embodiment, the temperature of the 
reaction may be shifted (Le,, it may be raised to a temperature that will cause the probe to 
dissociate) then lowered to a temperature at which a new copy of the probe hybridizes to 
the target and is cleaved by the enzyme, hi a further embodiment, the process of raising 
and lowering the temperature is repeated many times, or cycled, as it is in PGR (MuUis 
and Faloona, Methods in Enzymology, 155:335 [1987], Saiki et al. Science 230:1350 
[1985]). 

As noted above, 5' nucleases of Pol A type DNA polymerases are preferred for 
cleavage of an invasive cleavage structure that comprises an RNA target strand. The 
present invention provides enzymes having improved performance in detection assays 
based on the cleavage of a structure comprising RNA. Li particular, the altered 
polymerases of the present invention exhibit improved performance in detection assays 
based on the cleavage of a DNA member of an invasive cleavage structure that comprises 
an RNA target strand. 

The 5' nucleases of the present invention may be derived from Pol A type DNA 
polymerases. The terminology used in describing the alterations made in this class of 5' 
nucleases relates to the descriptions of DNA polymerase structures known in the art. The 
Klenow fragment of the Pol A polymerase jfrom E. coli (the C-terminal two thirds, which 
has the DNA synthesizing activity but lacks the 5' nuclease activity) has been described 
as having a physical form resembling a right hand, having an open region called the 
"palm", and a cleft that holds the primer/template duplex defined on one side by a 
"fingers" domain and on the other by a "thumb" domain (Joyce and Steitz, Trends in 
Biochemical Science 12:288 [1987]). This is shown schematically in Figure 5. Because 
this physical form has proved to be common to all Pol A DNA polymerases and to a 
number of additional template-dependent polymerizing enzymes such as reverse 
transcriptases, the hand terminology has become known in the art, and the sites of activity 
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in these enzymes are often described by reference to their position on the hand. For 
reference, and not intended as a limitation on the present invention, the palm is created 
from roughly the first 200 amino acids of the polymerase domain, the thumb from the 
middle 140, and the fingers by the next 160, with the base of the cleft formed from the 
5 remaining regions (Figures 6). Although some enzymes may deviate from these 
structural descriptions, the equivalent domains and sequences corresponding to such 
domains may be identified by sequence homology to known enzyme sequences, by 
comparison of enzyme crystal structures, and other like methods. 

In creating the improved enzymes of the present invention, several approaches 
^ 1 0 have been taken, although the present invention in not limited to any particular approach. 
First two DNA polymerases, Taq and Tth, that have different rates of DNA strand 
cleavage activity on RNA targets were compared. To identify domains related to the 
H?! differences in activity, a series of chimerical constructs was created and the activities 
fli were measured. This process identified two regions of the Tth polymerase that could, if 
^ 15 transferred into the Taq polymerase, confer on the TaqPol an RNA-dependent cleavage 
'J activity equivalent to that of the native Tth protein. Once these regions were identified, 

|i| the particular amino acids involved in the activity were examined. Since the two proteins 
J.J are about 87 percent identical in amino acid sequence overall, the identified regions had 

Q only a small number of amino acid differences. By altering these amino acids singly and 

20 in combinations, a pair of amino acids were identified in TthPol that, if introduced into 
the TaqPol protein, increased the rate of cleavage up to that of the native TthPol. 

These data demonstrate two important aspects of the present invention. First, 
specific amino acids can be changed to confer TthPol-Uke RNA-dependent cleavage 
activity on a polymerase having a lesser activity. More broadly, however, these results 
25 provide regions of these polymerases that are involved in the recognition of the 

RNA-containing cleavage structure. Identification of these important regions, combined 
with published information on the relationships of other amino acids to the various 
fimctions of these DNA polymerases and computer-assisted molecular modeling during 
the development of the present invention have allowed a rational design approach to 
30 create additional improved 5' nucleases. The information also allowed a focused random 
mutagenesis approach coupled with a rapid screening procedure to quickly create and 
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identify enzymes having improved properties. Using these methods of the present 
invention, a wide array of improved polymerases are provided. 

The methods used in creating and selecting the improved 5' nucleases of the 
present invention are described in detail below and in the experimental examples. A 
5 general procedure for screening and characterizing the cleavage activity of any 5* 

nuclease is included in the experimental examples. The methods discussions are divided 
into the following sections: I) Creation and selection of chimerical constructs; II) 
Site-specific mutagenesis based on information jfrom chimerical constructs; HI) 
Site-specific mutagenesis based on molecular modeling and published physical studies; 
10 and IV) focused random mutagenesis. Additional chimerical constructs not included in 
^ this discussion are described in the Experimental Examples, 

1) Creation and selection of chimerical constructs 
O The PolA-type DNA polymerases, including but not limited to DNA polymerase 

15 enzymes from Thermus species, comprise two distinctive domains, the 5' nuclease and 
;U the polymerase domains, shown schematically in Figure 6. The polymerase domains 
ni reside in the C-terminal two-thirds of the proteins and are responsible for both 

ii| DNA-dependent and RNA-dependent DNA polymerase activities. The N-terminal 
fc! one-third portions contain the 5' nuclease domains. In the genus Thermus Pol A 

20 polymerase, the palm region consists of, roughly, amino acids 300-500, the thumb region 
includes amino acids 500-650, while the fingers region is formed by the remaining amino 
acids from 650 to 830 (Figure 6). 

The derivatives, Taq DN RX HT and Tth DN RX HT, of Taq and TthPol used in 
many of the experiments of the present invention, and described herein, are modified to 
25 reduce synthetic activity and to facilitate chimera construction, but have 5' nuclease 
activity essentially identical to unmodified TaqPol and TthPol. Unless otherwise 
specified, the TaqPol and TthPol enzymes of the following discussion refer to the DN RX 
HT derivative. 

TthPol has a 4-fold higher cleavage rate with the IL-6 RNA template (shown in 
30 Figure 10) than TaqPol (shown in Figures 1 1 and 12), although the Taq and TthPols show 
similarities of cleavage in DNA target structures (Figure 10). Since the amino acid 
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sequences of TaqPol and TthPol (Figures 8 and 9) share about 87% identity and greater 
than 92% similarity, the high degree of homology between the enzymes allowed creation 
of a series of chimeric enzymes between TthPol ^d TaqPol. The activity of the chimeric 
enzymes was used as a parameter to identify the region(s) of these proteins affecting 
RNA dependent 5' nuclease activity. 

The chimeric constructs between TthPol and TaqPol genes shown schematically 
in Figures 7 and 19 were created by swapping DNA fragments defined by the restriction 
endonuclease sites, EcoRI and BamHI, common for both genes, the cloning vector site 
Sail and the new sites, NotI, BstBI and Ndel, created at the homologous positions of both 
genes by site directed mutagenesis. The restriction enzymes have been abbreviated as 
follows: EcoRI is E; NotI is N; BstBI is Bs; Ndel is D, BamHI is B, and Sail is S. 

The activity of each chimeric enzyme was evaluated using the invasive signal 
amplification assay with the IL-6 RNA target (Figure 10), and the cycling cleavage rates 
shown in Figure 12 were determined as described in the Experimental Examples. 
Comparison of the cleavage rates of the first two chimeras, TaqTth(N) and TthTaq(N), 
created by swapping the polymerase and 5' nuclease domains at the NotI site (Figure 7), 
shows that TaqTth(N) has the same activity as TthPol, whereas its counterpart TthTaq(N) 
retains the activity of TaqPol (Figure 12). This result indicates that the higher cleavage 
rate of TthPol is associated with its polymerase domain and suggests an important role of 
the polymerase domain in the 5' nuclease activity. 

The next step was to identify a minimal region of TthPol polymerase that would 
give rise to the TthPol-Uke RNA dependent 5' nuclease activity when substituted for the 
corresponding region of the TaqPol sequence. To this end, the TaqTth(N) chimera was 
selected to generate a series of new constructs by replacing its TthPol sequence with 
homologous regions of TaqPol. First, the N-terminal and C-terminal parts of the TaqPol 
polymerase domain were substituted for the corresponding regions of TaqTth(N) using 
the common BamHI site as a breaking point to create TaqTth(N-B) and TaqTth(B-S) 
chimeras, respectively (Figure 7). TaqTth(N-B) which has the TthPol sequence between 
amino acids 328 and 593, is approximately 3 times more active than the TaqTth(B-S) and 
40% more active than TthPol (Figure 12). This result estabhshes that the Notl-BamHI 
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portion of the TthPol polymerase domain determines superior RNA-dependent 5' 
nuclease activity of TthPol. 

From these data it was determined that a central portion of the TthPol, when used 
to replace the homologous portion of TaqPol (TaqTth(N-B) construct) could confer 
superior RNA recognition on the chimerical protein composed primarily of Taq protein. 
In fact, the cycHng rate of this chimerical protein exceeded that of the parent TthPoL 
Comparison of chimeras that included sub-portions of the activity-improving region of 
TthPol, approximately 50% of the region in each case {See, TaqTth(N-D) and 
TaqTth(D-B), Figures 7 and 12) showed no significant improvement in RNA dependent 
activity as compared to the parent TaqPol. This result indicates that aspects of each half 
of the region are required for this activity. A construct having an only sUghtly smaller 
portion of the Tth insert portion (TaqTth(Bs-B)) showed activity that was close to that of 
the parent TthPol protein, but which was less than that of the TaqTth(N-B) construct. 

2) Site-specific mutagenesis based on information from chimerical 
constructs 

Comparison of the TthPol and TaqPol amino acid sequences between the BstBI 
and BamHI sites reveals only 25 differences (Figure 13). Among those, there are 12 
conservative changes and 13 substitutions resulting in a change in charge. Since the 
analysis of the chimeric enzymes has suggested that some critical amino acid changes are 
located in both BstBI-Ndel and Ndel-BamHI regions of TthPol, site directed 
mutagenesis was used to introduce the TthPol specific amino acids into the BstBI-Ndel 
and Ndel-BamHI regions of the TaqTth(D-B) and TaqTth (N-D) chimeras, respectively. 
Six TthPol-specific substitutions were generated in the BstBI-Ndel region of the 
TaqTth(D-B) by single or double amino acid mutagenesis and only one double mutation, 
W417L/G418K, was able to restore the TthPol activity with the IL-6 RNA target {See 
e.g.. Figure 14). Similarly, 12 TthPol specific amino acids were introduced at the 
homologous positions of the Ndel-BamHI region of the TaqTth(N-D) and only one of 
them, E507Q, increased the cleavage rate to the TthPol level {See e.g.. Figure 14). 

To confirm that the W417L, G418K and E507Q substitutions are sufficient to 
increase the TaqPol activity to the TthPol level, TaqPol variants carrying these mutations 
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were created and their cleavage rates with the IL-6 RNA substrate were compared with 
that of TthPoL Figure 15 shows that the TaqPol W417L/G418K/E507Q and TaqPol 
G4] 8K/E507Q mutants have L4 times higher activity than TthPol and more than 4 fold 
higher activity than TaqPol, whereas the TaqPol W417L/E507Q mutant has the same 
5 activity as TthPol, which is about 3 fold higher than TaqPol. These results demonstrate 
that K418 and Q507 of TthPol are important amino acids in defining its superior RNA 
dependent 5' nuclease activity compared to TaqPol 

The ability of these amino acids to improve the RNA dependent 5' nuclease 
activity of a DNA polymerase was tested by introducing the corresponding mutations into 

10 the polymerase A genes of two additional organisms: Thermus filiformus and Thermus 
scotoductus. TaqPol showed improved RNA dependent activity when it was modified to 
contain the W417L and E507Q mutations, which made it more similar at these residues 
to the corresponding residues of TthPol (K418 and Q507). The TfiPol was modified to 
have P420K and E507Q, creating TfiDN 2M, while the TscPol was modified to have 

15 E41 6K and E505Q, to create TscDN 2M. The activity of these enzymes for cleaving 

various DNA and RNA containing structures was determined as described in Example 1, 
using the IdT2, IrT3, hairpin and X-structures diagrammed in Figures 21 and 22, with the 
results shown in both Figure 25 and Table 8. Both enzymes have much less 
RNA-dependent cleavage activity than either the TthPol or the Taq 2M enzymes. 

20 However, introduction of the mutations cited above into these polymerases increased the 
RNA dependent cleavage activity over 2 fold compared to the unmodified enzymes 
(Figure 25). These results demonstrate that transferability of improved RNA dependent 
cleavage activity into a wide range of polymerases using the methods of the present 
invention. 

25 

3) Site-specific mutagenesis based on molecular modeling and published 
physical studies 

The positions of the G418H and E507Q mutations in the crystal structure of a 
complex of the large firagment of TaqPol (Klentaql) with a primer/template DNA 
30 determined by Li etal (Li etal. Protein Sci., 7:1116 [1998]) are shown in Figure 17. 

The E507Q mutation is located at the tip of the thumb subdomain at a nearest distance of 
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3.8 A and 18 A from the backbone phosphates of the primer and template strands, 
respectively. The interaction between the thumb and the minor groove of the DNA 
primer/template was previously suggested by the co-crystal structures of Klenow 
fragment DNA polymerase I (Breese et al, Science 260:352 [1993]) and TaqPol (Eom et 
ah, Nature 382:278 [1996]). Deletion of a 24 amino acid portion of the tip of the thumb 
in Klenow fragment, corresponding to amino acids 494-518 of TaqPol, reduces the DNA 
binding affinity by more than 100-fold (Minnick et al, J. Biol. Chem., 271 :24954 
[1996]). These observations are consistent with the hypothesis that the thumb region, 
which includes the E507 residue, is involved in interactions with the upstream substrate 
duplex. 

The W417L and the G418K mutations in the palm region of TaqPol (Figure 17) 
are located approximately 25 A from the nearest phosphates of the template and upstream 
strands, according to the co-crystal structures of TaqPol with duplex DNA bound in the 
polymerizing mode (Li et al. Protein Sci., 7:1 1 16 [1998], Eom et al. Nature 382:278 
[1996]). The same distance was observed between the analogous W513 and P514 amino 
acids of Klenow fragment and the template strand of DNA bound in the editing mode 
(Breese et al. Science 260:352 [1993]). Thus, no interactions between TaqPol and the 
overlapping substrate can be suggested from the available co-crystal studies for this 
region. 

Although an understanding of the mechanism of action of the enzymes is not 
necessary for the practice of the present invention and the present invention is not limited 
to any mechanism of action, it is proposed that the amino acids at positions 417 and 418 
in the palm region of TaqPol interact with the upstream substrate duplex only when the 
enzyme functions as a 5' nuclease, but no interaction with these amino acids occurs when 
TaqPol switches into polymerizing mode. This hypothesis suggests a novel mode of 
substrate binding by DNA polymerases called here the "5' nuclease mode." Several lines 
of evidence support this hypothesis. The study of the chimeric enzymes described here 
clearly separates regions of the polymerase domain involved in the 5' nuclease and 
polymerase activities. Accordingly, the W417L and G418K mutations, together with the 
E507Q mutation, affect the 5' nuclease activity of TaqPol on substrates having an RNA 
target strand (Figure 15), but have no effect on either RNA-dependent or DNA-dependent 
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DNA polymerase activities (Figure 16). On the other hand, mutations in the active site of 
TaqPol, such as R573A, R587A, E615A, R746A, N750A and D785N, which correspond 
to substitutions in Klenow fragment of E.coH DNA Pol I that affect both polymerase 
activity and substrate binding affinity in the polymerizing mode (Polesky et al, J. Biol. 
Chem., 265:14579 [1990], Polesky et aL, J. Biol, Chem., 267:8417 [1992], Pandey et al, 
Eur. J. Biochem., 214:59 [1993]) were shown to have little or no effect on the 5' nuclease 
activity. Superposition of the polymerase domains of TaqPol (Eom et al. Nature 
382:278 [1996]), E.coH Pol I and Bacillus stearothermophilus Pol I (Kiefer et al. Nature 
391:304 [1998]) using the programs DALI (Hohn and Sander, J. MoL Biol., 233:123 
[1993], Hohn and Sander, Science 273:595 [1996]) and Insight H (Molecular Simulation 
Mc, Naperville, IL) shows that the palm region of TaqPol between amino acids 402-451, 
including W417 and G418, is structurally highly conserved between the three 
polymerases, although there is no structural similarity between the rest of the palm 
subdomains. This observation suggests an important role for this region in eubacterial 
DNA polymerases. 

The 5' nuclease and polymerase activities should be precisely synchronized to 
create a nicked structure rather than a gap or an overhang that could cause a deletion or 
an insertion during Okazaki fragment processing or DNA repair, if ligase joins the ends 
inappropriately. According to the previously proposed model (Kaiser et al , J. BioL 
Chem., 274:21387 [1999]), the 3' terminal nucleotide of the upstream strand is 
sequestered by the 5' nuclease domain to prevent its extension, thus halting synthesis. 
The interaction with the 3' nucleotide apparently activates the 5' nuclease that 
endonucleolitically removes the displaced 5' arm of the downstream strand. This 
cleavage occurs by the precise incision at the site defined by the 3' nucleotide, thus 
creating the nick. This model requires a substantial rearrangement of the 
substrate-enzyme complex, which may include a translocation of the complex to the 5' 
nuclease mode to separate the primer/template from the polymerase active site. 

It is possible that a relocation of the substrate away from the polymerase active 
site could be induced by the interaction between the duplex formed between the template 
and incoming strands and the crevice formed by the finger and thumb subdomains. Such 
an interaction could force conformational transitions in the thxmib that would bring the 
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template/primer duplex into close contact with the W417 and G418 amino acids. 
Significant flexibility of the thumb has been previously reported that might explain such 
changes (Beese et al. Science 260:352 [1993], Eom et aL, Nature 382:278 [1996], OUis 
et aL, Nature 313:762 [1985], Kim et aL, Nature 376:612 [1995], Korolev et aL, Proc. 

5 Natl. Acad. Sci., 92:9264 [1995], Li et aL, EMBO J., 17:7514 [1998]). Additional 

conformational changes of the fingers domain that might help to open the crevice, such as 
the transition from the 'closed* to the 'open* structure described by Li et aL (Li et aL, 
EMBO J., 17:7514 [1998]), are consistent with this model. It may be that the 5* nuclease 
binding mode was not observed in any of the published co-crystal structures of a DNA 

10 Pol I because the majority of the structures were solved for the polymerase domain only, 
with a template/primer substrate rather than with an overlapping 5' nuclease substrate. 

Km values of 200-300 nM have been determined for TaqPol, TthPol and TaqPol 
G418K/E507Q for the RNA containing substrate. These values are much higher than the 
Km value of <1 nM estimated for TthPol with an all-DNA overlapping substrate 

15 suggesting that the RNA template adversely affects substrate binding. The low affinity 
could be explained by the unfavorable interaction between the enzyme and either the 
A-form duplex adopted by the substrate with an RNA target, or the ribose T hydroxyls of 
the RNA strand. Between these two factors, the latter seems more Hkely, since the 5' 
nucleases of eubacterial DNA polymerases can efficiently cleave substrates with an RNA 

20 downstream probe (Lyamichev et aL, Science 260:778 [1993]), which would presumably 
have an A-form. Further, the co-crystal studies suggest that the template/primer duplex 
partially adopts a conformation close to A-form in its complex with DNA polymerase 
(Eom et aL, Nature 382:278 [1996], Kiefer et aL, Nature 391:304 [1998], Li et aL, 
EMBO J., 17:7514 [1998]). The G418K/E507Q mutations increase the kcat of TaqPol 

25 more than two fold, but have httle effect on K^. Such an effect would be expected if the 
mutations position the substrate in an orientation more appropriate for cleavage rather 
than simply increasing the binding constant. 

In addition to the mutational analysis described above, another approach to 
studying specific regions of enzymes, enzyme structure-function relationships, and 

30 enzyme-substrate interaction is to investigate the actual, physical structure of the 
molecule. 
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With the advances in crystallographic, NMR, and computer and software 
technology, study of molecular structure has become a viable tool for those interested in 
the configuration, organization, and dynamics of biomolecules. Molecular modeling has 
increased the understanding of the nature of the interactions that underiie the structure of 
5 proteins and how proteins interact functionally with substrate. Numerous pubhcations 
describing the structures of various polymerases or polymerase protein portions, HIV 
reverse transcriptase, and other nucleic acid binding proteins have provided mechanistic 
insights into protein conformation, changes in conformation, and molecular interactions 
necessary for fimction. 
10 As an example, the report by Doublie et al (Doublie etal^ Nature 391:251 

H [1998]) discloses the crystal structure of T7 DNA polymerase and provides information 
p about which amino acid regions are likely to have an affect on substrate binding, which 
^ are required to contact the substrate for polymerization, and which amino acids bind 
Bl cofactors, such as metal ions. It is noted in this paper and others that many of the 
\Q 15 polymerases share not only sequence similarity, but structural homology as well. When 
% certain structural domains of different polymerases are superimposed (for example, T7 

111 polymerase, Klenow fragment editing complex, the unUganded Taq DNA polymerase 

w 

and the Taq Polymerase-DNA complex) conserved motifs are clearly discemable, 
y Specifically, combining the information from all of these different structural 

20 sources and references, a model of the protein interacting with DNA, RNA, or 

heteroduplex can be made. The model can then be examined to identify amino acids that 
may be involved in substrate recognition or substrate contact. Changes in amino acids 
can be made based on these observations, and the effects on the various activities of the 5' 
nuclease proteins are assessed using screening methods such as those of the present 

25 invention, described in the experimental examples. 

The domain swapping analysis discussed previously demonstrated that sequences 
of TthDN that are important in RNA-dependent 5' nuclease activity he in the polymerase 
domain of the protein. Therefore, study of structural data of the polymerase domain with 
respect to nucleic acid recognition provides one method of locating amino acids that, 

30 when altered, alter RNA recognition in a 5' nuclease reaction. For example, analysis 

conducted during the development of the present invention examined published analyses 

134 



relating to primer/template binding by the polymerase domain ofE. coli Pol 1, the 
Klenow fragment. Table 2 shows a sampling of kinetic constants determined for the 
Klenow fragment, and shows the effects a number of mutations on these measurements. 
The corresponding or similarly positioned amino acids in the TaqPol are indicated in the 
5 right hand column. It was postulated that mutations having a noticible impact on the 
interactions of the Klenow fragment with the DNA template or the primer/template 
duplex, as indicated by changes in Kd and Relative DNA affinity values, might also have 
effects when made at the corresponding sites in TaqPol and related chimerical or mutant 
% derivatives. A selection of the mutations that produced a higher Kd value or a lower 
10 Relative DNA affinity value when introduced into the Klenow fragment were created and 
examined in TaqPol. These Taq derivatives include, but are not Umited to, those 

tifZi. 

F| indicated by asterisks in the right hand column of Table 2. 

^ For some Klenow variants, such as the R682 mutants, selection for testing was 

Hi not made based on the DNA affinity measurements, but because molecular modeling 

1 si 

2 15 suggested interaction between some aspect of the template/primer duplex and that amino 

acid. Similarly, additional regions of Taq polymerase (or Taq derivatives) were targeted 
f% for mutagenesis based on structural data and information from molecular modeUng. 
l^S Based on modeling, the thumb region was postulated to contact an RNA template. Thus, 

!3 amino acids in the thumb region were looked for that, if altered, might alter that contact. 
20 For example. Figures 6 and 17 show that amino acids 502, 504, and 507 are located at the 
tip of the thumb. It was postulated that altering these amino acids might have an affect 
on the enzyme-substrate interaction. Using the activity screening methods described hi 
Example 1, mutations that produced beneficial effects were identified. This approach 
was used to create a number of improved enzymes. For example, TaqPol position H784, 
25 corresponding to Klenow amino acid H881, is an amino acid in the fingers region and, as 
such, may be involved in primer/template substrate binding. When the H881 amino acid 
in the Klenow enzyme is replaced by alanine, the change decreases the affinity of the 
enzyme for DNA to only 30 to 40% of the wild type level. An analogous substitution 
was tested in a TaqPol-derived enzyme. Starting with the Taq derivative 
30 W417L/G418K/E507Q, amino acid 784 was changed from Histidine (H) to Alanine (A) 
to yield the W417L/G418K/E507Q/H784A mutant, temied Taq 4M. This variant showed 
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improved 5' nuclease activity on the RNA test IrTl (Figure 24) test substrate (data in 
Table 3). Amino acid R587 is in the thumb region, and was selected for mutation based 
on its close proximity to the primer/template duplex in computer models. When an 
R587A mutation was added to the Taq 4M variant, the activity on the test IrTl test 
substrate was still further improved. Li addition, the reduction, relative to the 4M variant, 
in cleavage of the X structure shown in Figure 22 constitutes an additional improvement 
in this enzyme's function. 

Not all amino acid changes that reduce DNA binding in the polymerization affect 
the 5' nuclease activity. For example, mutations E615A, R677A, affecting amino acid 
that are also in the thumb and fingers domains, respectively, have either adverse effect, or 
no effect on the 5' nuclease activities, respectively, as measured using the test substrates 
in Figures 21 and 22, and compared to the parent variants that lacked these changes. The 
R677A mutation was added to, and compared with the TaqSS variant, while the E615A 
mutation was added to and compared with the Taq 4M variant. The test methods 
described herein provide a convenient means of analyzing any variant for the alterations 
in the cleavage activity of both invasive an noninvasive substrates, for both DNA and 
RNA containing structures. Thus, the present invention provides methods for identifying 
all suitable improved enzymes. 

Alterations that might increase the affinity of the enzymes for the nucleic acid 
targets were also examined. Many of the mutations described above were selected 
because they caused the Klenow fragment enzyme to have decreased affinity for DNA, 
with the goal of creating enzymes more accepting of structures containing non-DNA 
strands. In general, the native DNA polymerases show a lower affinity for RNA/DNA 
duplexes, compared to their affinity for DNA/DNA duplexes. During the development of 
the present invention, it was sought to increase the general affinity of the proteins of the 
present invention for a nucleic acid substrate without restoring or increasing any 
preference for structures having DNA rather than RNA target strands. The substitution of 
amino acids having different charges was examined as a means of altering the interaction 
between the proteins and the nucleic acid substrates. For example, it was postulated that 
addition of positively charged amino acid residues, such as lysine (K), might increase the 
affinity of a protein for a negatively charged nucleic acid. 
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As noted above, alterations in the thumb region could affect the interactions of the 
protein with the nucleic acid substrate. In one example, the mutation G504K (tip of the 
thumb) was introduced in Taq4M and caused and enhancement of nuclease activity by 
15% on an RNA target. Additional positively charged mutations (A502K and E507K) 
5 further improve the RNA target dependent activity by 50% compared to the parent 
Taq4M enzyme. 

The use of data from pubUshed studies and molecular modehng, in combination 
with results accrued during the development of the present invention allowed the 
. . J identification of regions of the proteins in which changes of amino acids would be likely 
10 to cause observable differences in at least one aspect of cleavage function. While regions 
, , could be targeted in this way, it was observed that changes in different amino acids, even 
G3 if near or immediate neighbors in the protein, could have different effects. For example, 

while the A502K substitution created a marked increase in the RNA-dependent cleavage 
E activity of Taq 4M, changing amino acid 499 from G to a K, only 3 amino acids away 

Ml 

y| 15 from 502, gave minimal improvement. As can be seen in the Experimental Examples, 

J. the approach of the present invention was to change several amino acids in a candidate 

y region, either alone or in combination, then use the screening method provided in 

P| Example 1 to rapidly assess the effects of the changes. In this way, the rational design 

is 

K approach is easily applied to the task of protein engineering. 

20 In addition to the thumb, pahn, and hand regions found in the polymerase domain 

of these proteins, regions that are specific to 5' nucleases and nuclease domains were 
examined. Comparative studies on a variety of 5' nucleases have shown that, though the 
amino acid sequences vary dramatically from enzyme to enzyme, there are structural 
features common to most. Two of these features are the hehx-hairpin-heUx motif 

25 (H-h-H) and the arch or loop structure. The H-h-H motif is believed to mediate 

non-sequence specific DNA binding. It has been found in at least 14 families of proteins, 
including nucleases, N-glycosylases, ligases, helicases, topoisomerases, and polymerases 
(Doherty et al, Nucl. Acid. Res., 24:2488 [1996]). The crystallographic structure of rat 
DNA polymerase pol p bound to a DNA template-primer shows non-specific hydrogen 

30 bonds between the backbone nitrogens of the pol |3 HhH motif and the phosphate 
oxygens of the primer of the DNA duplex (Pelletier et aL, Science 264:1891 [1994]). 
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Because the HhH domain of 5' nuclease domains of Taq and Tth polymerases may 
function in a similar manner, it is contemplated that mutations in the HhH region of the 
enzyme alter activity. Mutations may be introduced to alter the shape and structure of the 
motif, or to change the charge of the motif to cause increased or decreased affinity for 
substrate. 

Another structure common to many 5' nucleases from diverse sources such as 
eukaryotes, eubacteria, archaea and phage, is the arch or loop domain. The crystal 
structure of the 5' exonuclease of bacteriophage T5 showed a distinct arch formed by two 
helices, one positively charged and one containing hydrophobic residues (Ceska et al^ 
Nature 382:90 [1996]). Interestingly, three residues that are conserved between T5 and 
Taq, Lys 83, Arg 86 and Tyr 82 are all in the arch. These correspond to amino acids Lys 
83, Arg 86, and Tyr 82 in Taq DNA polymerase. The crystal structure for Taq (5' 
nuclease) has also been determined (Kim et al. Nature 376:612 [1995]). 

The crystal structure from the flap endonuclease-1 from Methanococcus 
janneschii also shows such a loop motif (Hwang et ah, Nat. Struct. Biol., 5:707 [1998]). 
The backbone crystal structure of Mja FEN-1 molecules may be superimposed on T5 
exonuclease, Taq 5 '-exonuclease and T4 RNase H. An interesting feature common to all 
of these is the long loop. The loop of FEN-1 consists of a number of positively charged 
and aromatic residues and forms a hole with dimensions large enough to accommodate a 
single-stranded DNA molecule. The corresponding region in T5 exonuclease consists of 
three helices forming a helical arch. The size of the hole formed by the helical arch in T5 
exonuclease is less than half that formed by the LI loop in Mj FEN-1 . In T4 RNase H or 
Taq 5' exonuclease, this region is disordered. Some regions of the arch bind metals, 
while other regions of the arch contact nucleic acid substrate. Aligmnent of the 
amino-acid sequences of six 5' nuclease domains from DNA polymerases in the pol I 
family show six highly conserved sequence motifs containing ten conserved acidic 
residues (Kim et al. Nature 376 [1995]). 

The effects of alterations in the arch region were examined. In Taq polymerase 
the arch region is formed by amino acids 80-95 and 96-109. Site directed mutagenesis 
was performed on the arch region. Alignment of amino acid sequences of the FEN and 
polymerase 5' nucleases suggested the design of 3 amino acid substitution mutations. 
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P88E, P90E and G80E. These substitutions were made on the Taq4M polymerase mutant 
as a parent enzyme. Results indicated that although the background activity on the HP 
and X substrates shown in Figure 22 are tremendously suppressed in all mutants, the 
desirable 5' nuclease activity on proper substrates (IdT and L"T, Figure 24) is also 
reduced. Despite the sequence homology between Taq and Tth polymerases, they have 
very different activity on HP and X substrates. The alignment of the Taq and Tth 
polymerase arch regions also demonstrates regions of extensive sequence homology as 
well as minor differences. These differences led to the design of mutations L109F and 
Al lOT using Taq4M to generate Taq4M L109F/A1 lOT, and the mutant Taq 4M 
A502K/G504K/E507K/T514S to generate Taq 4M 

L109F/A1 10T/A502K/G504K/E507K/T514S mutant. These two mutations have 
drastically converted Taq4M enzyme to become more like Tth enzyme in terms of the 
background substrate specificity while the 5' nuclease activities on both DNA and RNA 
substrates are ahnost unchanged. 

4) Focused random mutagenesis 

As described above, physical studies and molecular modeling may be used alone 
or in combination to identify regions of the enzymes in which changes of amino acids are 
likely to cause observable differences in at least one aspect of cleavage function. Li the 
section above, use of this information was described to select and change specific amino 
acids or combinations of amino acids. Another method of generating an enzyme with 
altered function is to introduce mutations randomly. Such mutations can be introduced 
by a number of methods known in the art, including but not limited to, PGR amplification 
under conditions that favor nucleotide misincorporation (REF), amplification using 
primers having regions of degeneracy (i.e., base positions in which different individual, 
but otherwise similar oligonucleotides in a reaction may have different bases), and 
chemical synthesis. Many methods of random mutagenesis are known in the art (Del Rio 
a/., Biotechniques 17:1132 [1994]), and may be incorporated into the production of the 
enzymes of the present invention. The discussions of any particular means of 
mutagenesis contained herein are presented solely by way of example and not intended as 
a limitation. When random mutagenesis is performed such that only a particular region 
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of an entire protein is varied, it can be described as "focused random mutagenesis." As 
described in the Experimental Examples, a focused random mutagenesis approach was 
applied to vary the HhH and the thumb domains some of the enzyme variants previously 
created. These domains were chosen to provide examples of this approach, and it is not 
5 intended that the random mutagenesis approach be limited to any particular domain, or to 
a single domain. It may be apphed to any domain, or to any entire protein. Proteins thus 
modified were tested for cleavage activity in the screening reactions described in 
Example 1, using the test substrates diagrammed in Figures 22 and 24, with the results 
described in Tables 6 and 7. 
1 0 Random mutagenesis was performed on the HhH region with the parent TaqSS or 

TthDN H785A mutants. None of the 8 mutants generated showed an improvement in 
Q activity compared to the parent enzyme (Table 6). In fact, mutations of the region 
^ between residues 198-205 have about 2-5 fold lower activity on both DNA and RNA 
ri substrates, suggesting that this region is essential for substrate recognition. Mutagenesis 

1 5 in the thumb region resulted in new mutations that improved 5 ' nuclease activity by 
^ 20-100% on a DNA target and about 10% on an RNA target (Table 7). 

|i| Numerous amino acids in each of the distinct subdomains play roles in substrate 

%Z contact. Mutagenesis of these may alter substrate specificity by altering substrate 

Q binding. Moreover, mutations introduced in amino acids that do not directly contact the 

20 substrate may also alter substrate specificity through longer raage or general 

conformation altering effects. These mutations may be introduced by ^y of several 
methods known in the art, including, but not Umited to random mutagenesis, site directed 
mutagenesis, and generation of chimeric proteins. 

As noted above, numerous methods of random mutagenesis are known in the art. 
25 The methods appUed in the focused random mutagenesis described herein may be appUed 
to whole genes. It is also contemplated that additional usefiil chimerical constructs may 
be created through the use of molecular breeding (See e.g., U.S. Pat. No. 5,837,458 and 
PCT Publications WO 00/18906, WO 99/65927, WO 98/31837, and WO 98/27230, 
herein incorporated by reference in their entireties). Regardless of the mutagenesis 
30 method chosen, the rapid screening method described herein provides a fast and effective 
means of identif3dng beneficial changes within a large collection of recombinant 
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molecules. This makes the random mutagenesis procedure a manageable and practical 
tool for creating a large collection of altered 5' nucleases having beneficial 
improvements. The cloning and mutagenesis strategies employed for the enzymes used as 
examples are applicable to other thermostable and non-thermostable Type A 
polymerases, since DNA sequence similarity among these enzymes is very high. Those 
skilled in the art would understand that differences in sequence would necessitate 
differences in cloning strategies, for example, the use of different restriction 
endonucleases may be required to generate chimeras. Selection of existing alternative 
sites, or introduction via mutagenesis of altemative sites are well established processes 
and are known to one skilled in the art. 

Enzyme expression and purification can be accomplished by a variety of 
molecular biology methods. The examples described below teach one such method, 
though it is to be understood that the present invention is not to be hmited by the method 
of cloning, protein expression, or purification. The present invention contemplates that 
the nucleic acid construct be capable of expression in a suitable host. Numerous methods 
are available for attaching various promoters and 3' sequences to a gene structure to 
achieve efficient expression. 

5) Site-Specific mutagenesis 

In some embodiments of the present invention, any suitable technique (e.g., 
including, but not limited to, one or more of the techniques described above) are used to 
generate improved cleavage enzymes (e.g., SEQ ID NO:221) with heterologous domains. 
Accordingly, in some embodiments, site-specific mutagenesis (e.g., primer-directed 
mutagenesis using a commercially available kit such as the Transformer Site Directed 
mutagenesis kit (Clontech)) is used to make a plurality of changes thoughout a nucleic 
acid sequence in order to generate nucleic acid encoding a cleavage enzyme of the 
present invention. Insome embodiments, a plurality of primer-directed mutagenesis steps 
are carried out in tandem to produce a nucleic acid encoding a cleavage enzyme of the 
present invention. 

In some embodiments, a plurality of primer directed mutagenesis steps are 
directed to a selected portion of a nucleic acid sequence, to produce changes in a selected 
portion of a cleavage enzyme of the present invention. In other embodiments, a nucleic 
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acid having changes in one selected portion is recombined with a nucleic acid having 

mutations in a different selected portion through cloning, molecular breeding, or 

any of the other recombination methods known in the art), thereby creating a nucleic acid 

having mutations in a plurality of selected portions, and encoding a cleavage enzyme of 

5 the present invention. The mutations in each selected portion may be introduced by any 

of the methods described above, or any combination of said methods, including but not 

limited to methods of random mutagenesis and site-directed mutagenesis. 

For example, in one illustrative embodiment of the present invention, the nucleic 

acid sequence of SEQ ID NO:222 (a nucleic acid sequence encoding the cleavage 

10 enzyme of SEQ ID NO:221) is generated by making a plurality of primer-directed 

mutations to the nucleic acid sequence of SEQ ID NO: 104 (see Example 7 for the 

O construction of SEQ ID NO: 104). In some embodiments, each mutation is introduced 
P . 

using a separate mutagenesis reaction. Reactions are carried out sequentially such that 
:g the resuhing nucleic acid (SEQ ID NO: 222) contains all of the mutations. In another 
y 15 illustrative embodiment of the present invention, the nucleic acid sequence of SEQ ID 
]^ NO:222 is generated by making a plurality of primer-directed mutations, as described 

Q- above, in the nuclease portion (e.g., as diagrammed in Figure 6) of SEQ ID NO: 111. The 

fl 

m mutant nuclease portion is then combined with the "polymerase" portion of SEQ ID 

\m 

'pl NO: 1 04 at the Not I site, using the recombination methods described in Example 4, 
r| 20 thereby creating a single nucleic acid having SEQ ID NO:222, and encoding the cleavage 
enzyme of SEQ ID NO:221. Following mutagenesis, the resulting altered polypeptide is 
produced and tested for cleavage activity using any suitable assay (e.g., including, but not 
limited to, those described in Examples 1 and 6). In some embodiments, the nucleic acid 
sequence encoding the cleavage enzyme of SEQ ID NO:221 (e.g., SEQ ED NO:222) is 
25 further modified using any suitable method. 

V. Reaction Design for INVADER Assay Detection of RNA Targets; 

Approaches to designing INVADER assays for the detection of RNA targets can 
vary depending on the needs of a particular assay. For example, in some embodiments, 
30 an RNA to be detected or analyzed may be present in a test sample at low levels, so a 

high level of sensitivity (i.e., a low Umit of detection, or LOD) may be desirable; in other 
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embodiments, an RNA may abundant, and may not require an especially sensitive assay 
for detection. In some embodiments, an RNA to be detected may be similar to other 
RNAs in a sample that are not intended to be detected, so that a high level of selectivity 
in an assay is desirable, while in other embodiments, it may be desired that multiple 
similar RNAs be detected in a single reaction, so an assay may be provided that is not 
selective with respect to the differences among these similar RNAs. 

hi some embodiments it is especially desirable to avoid detection of any DNA 
molecules related to the target RNA molecules in a reaction. La some embodiments, this 
is accomplished by designing INVADER assay probe sets to RNA splice junctions, such 
that only the properly spliced mRNAs provide the selected target sites for detection. In 
other embodiments, samples are handled such that DNA remains double stranded (e.g., 
the nucleic acids are not heated or otherwise subjected to denaturing conditions), and is 
thus not available to serve as target in an INVADER assay reaction. In other 
embodiments, cells are lysed under conditions that leave nuclei intact, thereby containing 
and preventing detection of the genomic DNA, while releasing the cytosolic mRNAs into 
the lysate solution for detection by the assay. 

In some embodiments, the INVADER assay is to be used for detection or 
quantitation of an entire RNA having a particular variation of a sequence (e.g., a mutation 
a SNP, a particular spliced junction); in such embodiments, the location of the base or 
sequence to be detected is a determining factor in the selection of a site for the 
INVADER assay probe set to hybridize. In other embodiments, any portion of an RNA 
target may be used to indicate the presence or the amount of the entire RNA (e.g., as in 
gene expression analysis). In this case, the probe sets may be directed toward a portion 
of the RNA selected for optimal performance (e.g., sites determined to be particularly 
accessible for probe hybridization) as a target in the INVADER assay. 

The discussion of INVADER assay probe design is divided into the following sections: 

i. Target site selection based on accessibiUty 

ii. Target site selection based on selectivity 

iii. OUgonucleotide design 

a. Target-specific regions: length and melting temperature 
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b. Non-complementary regions 

c. Folding and dimer analysis 
iv. Assay performance evaluation 
V. Design and assay optimization 

5 

i. Target site selection based on accessibility 

One consideration in the selection of sites for detection is the availability of the 
target site for hybridization of the assay probe set. To simply use randomly selected 
complementary oligonucleotides for a given RNA target without prior knowledge of 
10 regions of the RNA that allow efficient hybridization can be an ineffective approach. For 
example, it is estimated that targeting RNA with antisense oligonucleotides based on 
random design results in one out of 18-20 tested oligonucleotides showing significant 
inhibition of gene expression (Sczakiel, Fronteirs in Biosciences 5:194 [2000]; Patzel et 
al. Nucleic Acids Res., 27:4328 [1999]; Peyman et al, Biol. Chem. Hoppe-Seyler 
15 367:195 [1995]; Monia et al. Nature Med., 2:668 [1996]). Secondary and tertiary 
structures of RNA are thought to be the major reasons that influence the ability of an 
oligonucleotide to bind targeted regions of the RNA (Vickers et al. Nucleic Acids Res., 
fl 28:1340 [2000]; Lima et al. Biochemistry 31:12055 [1992]; Uhlenbeck, J. Mol. Biol., 

65:25 [1972]; Freier and Tinoco, Biochemistry 14:3310 [1975]). This is due to the 
20 hybridization kinetics and thermodynamics of destroying any structural motifs of the 
RNA and, in return, hybridizing the complementary DNA oligonucleotide (Patzel et al. 
Nucleic Acids Res., 27:4328 [1999]; Mathews et al, RNA 5:1458 [1999]). Thus, the 
ability to identify regions of RNA that are "accessible" for hybridization is important for 
design and selection of effective oligonucleotides. 
25 There are several experimental and theoretical methods available for identifying 

accessible regions in RNA. These include the use of RNase-H footprinting (Ho et al, 
Nature Biotechnology 16:59 [1998]; Mateeva et al. Nucleic Acids Res., 25:5010 [1997]; 
Mateeva et aL, Nature Biotechnology 16:1374 [1998]), complementary arrays of 
oligonucleotide libraries (Southern et ai. Nucleic Acids Res., 22:1368 [1994]; Mir and 
30 Southern, Nature Biotechnology 17:788 [1999]), ribozyme libraries with random 

hexamer internal guide sequences (Campbell and Cech, RNA 1:598 [1995]; Lan et aL, 



n 
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Science 280:1593 [1998]), and RNA and DNA structure prediction computer programs 
(Sczakiel, Frontiers in Biosciences 5:194 [2000]; Patzel et aL, Nucleic Acids Res., 
27:4328 [1999]; Zuker, Science 244:48 [1989]; Walton et al, Biotechnol. Bioeng., 65:1 
[1999]). Recently, new methods have been developed that use primer extension to 
5 identify sites in RNAs that are accessible for hybridization. Target nucleic acids (e.g., 
mRNA target nucleic acids) are contacted with a plurahty of primers containing a 3' a 
region of degenerate sequence and primer extension reactions are conducted. Where the 
target nucleic acid is an RNA molecule, preferred enzymes for use in the extension 
reactions are reverse transcriptases, which produce a DNA copy of the RNA template. 
; 10 Folded structures present in the target nucleic acid affect the initiation and/or efficiency 
PI of the extension reaction. The extension products of the primers are analyzed to provide 

5!f a map of the accessible sites. For example, certain extension products are not generated 

'CM 

where the primer is complementary to a sequence that is involved in a folded structure. 

Q Regions of the target nucleic acid that do not allow hybridization of the primer and do not 
15 result in the production of an extension product are considered inaccessible sites. In 

Q contrast, the presence of an extension product indicates that the primer was able to bind 

to an accessible region of the target nucleic acid. Such methods are referred to herein as 

© "reverse transcription with random oligonucleotide libraries" or "RT-ROL" (HT AUawi, 

Fi et al, RNA 7(2):314-27 [2001]; PCT Publication WO 01/98537, incorporated herein by 

20 reference in its entirety). The use of a physical measurement such as RT-ROL or array 
hybridization provides the most direct evidence of the accessibility of a site on an RNA 
strand. In general, INVADER assays directed toward accessible regions produce 
stronger signals for a given amount of RNA than assays directed toward less accessible 
regions of an RNA strand. For the detection of rare RNAs (e.g., fewer than about 5,000 
25 to 10,000 copies per INVADER assay reaction), or in any assay wherein it is desirable to 
have the best {i.e., lowest) limit of detection possible, it may be beneficial to start the 
assay design by analyzing the RNA structure using RT-ROL or another method of 
physical analysis. 

In other embodiments, ease of assay design may be more important than creation 
30 of an assay with a particularly low LOD. Structure prediction software can simphfy the 
task of determining which parts of an RNA are likely to be single stranded, and thus be 
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more accessible for probe hybridization. As first step, the sequence of an RNA to be 
detected is entered into an electronic file. It may be entered manually or imported from a 
file a sequence data file, or a word processing file). In some embodiments, the 
sequence is downloaded from a database, such as GenBank or EMBL. The RNA 

5 sequence can then be analyzed using a program such as mfold (Zucker, Science 244:48 
[1989]), OligoWalk (Mathews et al, RNA 5:1458 [1999]), and variations of both 
(Sczakiel, Frontiers in Biosciences 5:194 [2000]; Patzel et al^ Nucleic Acids Res., 
27:4328 [1999]; Walton et al, Biotechnol. Bioeng., 65:1 [1999]). 

\ Mfold Analysis for target RNA structure prediction. 

, 10 The output of mfold analysis can be used in several ways to assist in identifying 

pais 

O accessible regions of an RNA target molecule. In one embodiment, the mfold program is 
S| used to generate an "ss count" file for identifying regions least likely to be involved in 



intra-strand baseparing. In another embodiment, the mfold program is used to generate a 
".ct" file, a file used as input information for use with RNA Structure 3.5 to perform an 
15 OhgoWaUc analysis. In preferred embodiments, for either use, the sequence to be 

detected is entered into mfold. In a preferred embodiment, the settings used in the mfold 




analysis include: 



• Folding Temperature: 37'C (Even though the INVADER reaction may not be 
conducted at this temperature.) 

• % Suboptimality: 5 

• # foldings: 50 
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Window Parameter: Default 
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Maximum distance between paired bases: No Limit 
Select BATCH folding 

Enter: an e mail address where the results are to be sent when ready 
Image Resolution: High 



Structure Format: Bases 



Base Number Frequency:Default 

Structure Rotation Angle: 0 
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Structure Annotation: SS-Count 
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• IM NaCl (Australian mfold Internet site only) 

When results are ready, an e mail message is sent containing the Web address of 
the results. The only file that is necessary for subsequent INVADER assay probe design 

5 analysis is the SS-Count file, which is then downloaded from the Results page. An 
exemplary mfold analysis using a GenBank entry for Human Ubiquitin (#4506712) is 
shown below: 

SS-Count analysis for accessible sites identification. 

The SS-Count file is then imported into an Excel spreadsheet file and the 
10 following options are chosen: Data Type = Delimited <press Next>; Delimited = Select 
j-i (x) Spaces; (x) Treat multiple delimiters as one <press Next>; Column for data format = 
General. Selecting these options results in the import into Excel of three columns of data 
i!i (Figure 39). The first line in the first column is the total number of stable structures 

J mfold foimd under the parameters used in the folding. With the Ubiquitin example, there 

15 were 12 structures foimd. 
f^ The rest of the first column is the RNA nucleotide position numbered fix)m 1 . 

The second column is the raw SS-Count number and represents the number of times the 
O corresponding base was NOT base-paired as part of some secondary structure. The third 

n 

column is the sequence of the RNA analyzed, identifying the base at each numbered 
20 position. By looking for bases that are involved in fewer structures {le., bases hsted in 

column 3 corresponding to the higher numbers in column 2), it is possible to identify 

regions of the mRNA (identified by position number in column 1) that are more likely to 

be firee of intra-strand base-pairing, and thus are more likely to be available for detection 

using the INVADER assay. 
25 One way of viewing the data is to calculate the rumiing average SS-count for a ten 

nucleotide stretch of the RNA (the Ave(lO) hidex) and chart the Ave(lO) Index against 

the base pair position (See e.g., Figure 48). 

An alternative plot is to graph the nucleotide position against the Ave(lO) Index 

expressed as a percentage of the total number of structures found by mfold (Fig 39). As 
30 with the raw SS-count table, regions of the RNA corresponding to higher numbers in the 
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Ave(lO) Index are involved in fewer predicted folded structures. Viewing the Ave(lO) 
Index in either a chart or graph format reduces the complexity of the data, and can reveal 
longer stretches of the RNA that are more likely to be structure-free. For example, from 
the graph of the running average, a user can pick out all of the major peak areas as likely 
regions for INVADER assay probe design. This creates an SS-Count Candidate List. In 
the Human ubiquitin example, there is one major peak and about 6 other peaks (Fig. 39) 
The next step is to refer back to the raw (i.e., not the running average) SS-Count 
data from within each of the peak areas, and identify the residue where the running 
average is changing in magnitude, this is a local "tum" and is generally a good candidate 
residue to be positioned at the INVADER assay probe set cleavage site. For example, in 
Human Ubiquitin, residue Gl 19 is foimd at a minor local tum within a globally 
accessible region (Fig. 40). The INVADER assay probe set with the cleavage site at this 
location is a good performer in detection of this RNA. Placing the cleavage site at Gl 14 
did not result in better detection even though it had a higher %Ave(10) value. While not 
limiting the present invention to any particular mechanism, and an understanding of the 
mechanism is not necessary to practice the invention, this is likely to be because this area 
was less accessible to the Probe or INVADER oligonucleotides than was position Gl 19. 

OligoWalk Structure prediction with RNA Structure 3.5 for accessible sites 
identification 

In some embodiments, the program OhgoWalk, a module of the software 
"RNAStructure'' (Mathews et al, RNA 5:1458 [1999]) is used in the selection of sites 
that are more likely to be accessible for oligonucleotide binding, OligoWalk uses sets of 
themiodynamic parameters for both RNA and DNA, and their hybrids (AUawi and 
SantaLucia, Biochemistry 36:10581 [1997]; Mathews et al, J. Mol. Biol., 288:91 1 
[1999]; Sugimoto et al^ Biochemistry 34:1 121 1 [1995]) in an algorithm that relies on 
mfold for RNA secondary structure prediction (Zucker, Science 244:48 [1989]). 
OligoWalk is designed to predict the most favorable regions of an RNA target for 
designing antisense oligonucleotides by estimating the overall thermodynamics of 
hybridizing an antisense oligomer to the RNA by taking into account the thermodynmnics 
of destroying any structural motifs in the RNA target or the antisense oligonucleotide. 
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The affinity of the ohgomer to its target is expressed as an overall Gibbs free energy 
change of a self-structured ohgomer, and of a target associating into an ohgomer-target 
complex. This free energy is usually a negative number, indicating favorable binding, 
and is expressed in ^kcal/mol* units. OligoWalk analysis is performed with 8 to 15 base 
5 oligonucleotide size to resemble the average length of the analyte specific region of the 
Signal Probe. Plotting the total binding energy against the length of the RNA generates a 
graph of peaks and valleys. The lowest negative values generally indicate the most 
favorable sites for ohgonucleotides to bind. The most inaccessible regions have positive 
binding energy values, and generally are a poor sites for assay probe design 
10 In a preferred embodiment, the OligoWalk module of RNA Structure 3.5 is used 

to determine binding energies by performing an 8-base OligoWalk using the following 
p settings: 

5{ ■ Break Local Structure 

+^ ■ Include suboptimal structures 

Q 15 ■ Oligo Length: 8nt 

■ Oligo Concentration: 100 nM 
Q ■ Oligo Type: DNA 

1=1 ■ Walk entire Target RNA 

PI 20 When these parameters have been set, the sequence file to be folded (the ".ct" 

output file firom mfold) can be selected and opened. Once the sequence has been folded, 
a report can be created using the Output menu. The report is imported into Excel and the 
data generated above is plotted. Li a preferred embodiment, the OligoWalk data is 
graphed with the SS-Count data. The regions displaying the lowest fi-ee energy values 

25 {ie.^ the largest negative numbers) are generally the most likely to be accessible for 
hybridization. In preferred embodiments, the 3' end and the majority of the target- 
binding region of the probe ohgonucleotide complement an accessible region of the target 
RNA. In particularly preferred embodiments, the majority of the binding site for the 
corresponding INVADER oligonucleotide falls within the same accessible region. In 

30 another preferred embodiment, the binding site for an INVADER ohgonucleotide falls 
within a nearby accessible region. 
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An INVADER oligonucleotide can generally be positioned to bind to a less 
accessible site. While not limiting the present invention to any particular mechanism, it 
is observed that the INVADER oligonucleotides are generally longer than probe 
oligonucleotides used in the INVADER assay reactions and, because they are generally 
designed to remain bound to the target at the reaction temperature, they will be selected 
to have a TmS about 12 to 15 °C higher than that of a corresponding probe. Consequently, 
INVADER oligonucleotides may more readily break the local target structure, and thus 
may be less dependent on the accessibility of the target-binding site. 

In selecting among accessible sites for the design of INVADER assay 
oligonucleotides, the base composition of the site is also considered. It has been 
observed that stretched of more than 4 or 5 of the same nucleotide in a row (e.g., 
. . . AAAA. , . or . . .CCCC. . .) in any portion of the binding site for the assay 
ohgonucleotides may reduce the performance of the probe set in the assay (e.g., by 
increasing background or decreasing specificity). Thus, in preferred embodiments, any 
stretches comprising four or more repeated bases are generally avoided. Another 
consideration is the effect of base composition on lengths of the oligonucleotides in the 
probe set. In many cases, targeting A-T rich sequences requires the use of longer 
oligonucleotides for a reaction performed at a given temperature, compared to the length 
of oligonucleotides targeted to sequences having a more even distribution of A-T and G- 
C bases. Longer oligonucleotides can be more prone to formation of intrastrand 
structures and dimer structures. Thus, it is preferred that the distribution or A-T bases 
and G-C bases within a t^get region be as close to even (z.e., about 50% G-C content) as 
the region to be detected permits. In particul^ly preferred embodiments, the distribution 
of A-T and G-C positions is evenly distributed across the binding sites (e.g., not having 
all A-T positions in one half, with all G-C positions in the other). 

ii. Target site selection based on selectivity 

In some embodiments, probe sets are designed to examine highly homologous, or 
closely related RNA targets (i.e., targets that are very similar in sequence). In such 
embodiments, the RNA or homologous cDNA sequences are compared, e.g., using an 
aUgnment program such as MEGALIGN (DNAstar Madison, WI). 
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In some embodiments, selectivity is provided by designing probe sets to detect 
splice junctions. Splice junctions can be identified by aligning the cDNA and gene 
sequences using an alignment program {e,g, MEGALIGN) or under the BLAST menu at 
the NCBI website (BLAST 2 sequences). Splice junctions are also often listed in the 
GenBank report (intron/exon sites). INVADER assay oligonucleotide sets are designed 
such that the probe and INVADER oligonucleotides are complementary to the coding 
strand (mRNA), generally with the cleavage site being as close to the splice junction as 
possible. In some embodiments, different spUce junctions within an mRNA are analyzed 
for accessibility, as described above. In preferred embodiments, probe sets are designed 
to detect one or more splice junctions showing greater accessibility compared to the 
accessibility of other splice junctions within the same RNA target. 

In some embodiments designed to exclude detection of RNAs related to the target 
RNA, sequences are examined to identify bases that are unique to the target RNA when 
compared to the other similar sequences fi:om which the target is distinguished. 
Generally, the unique base is positioned to hybridize to the 5' end of the t^get-specific 
region of the probe oligonucleotide. In some embodiments, two adjacent bases are 
unique to the target compared to the related RNA. If two adjacent unique bases are 
available in an appropriately accessible portion of the target RNA, it is preferred that 
these bases be used as the site around which the probe and INVADER ohgonucleotides 
sets are designed. In some embodiments, the two unique bases are positioned such that 
the site of cleavage of the probe is between the two base-pairs they form with the probe. 
In other embodiments, one of the unique bases is in the last position of the hybridization 
site of the INVADER oUgonucleotide (/.e., it is positioned to base-pair to the penultimate 
residue on the 3' end of the INVADER oligonucleotide). 

In some embodiments, the assay is designed to include detection of RNAs that are 
similar, but not identical, to the target RNA. If the assay is being designed for inclusive 
detection, the compared sequences are examined to identify sites having complete 
homology. Such designs can be created to detect homologous sequences in the same 
species or between species. Generally, the most homologous regions are selected as 
hybridization targets for probe oligonucleotides. Generally, some variation can be 
tolerated, for example, if it is not at the base that would hybridize to the 5* end of the 
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target-specific region of a probe. In some embodiments, variation is accommodated by 
the use of degenerate bases in the INVADER assay oligonucleotides (e.g., mixtures of 
bases are used at positions within thesynthesized probe, INVADER and/or stacker 
oligonucleotides, said mixtures selected to complement the mixture of specific bases 
5 present in the collection of related target RNAs). 

iii. Oligonucleotide design 

a. Target-specific regions: length and melting temperature 

As described above in Section I (a) concerning the ohgonucleotide design, in 
1 0 some embodiments, the length of the analyte-specific regions are defined by the 
□ temperature selected for running the reaction. Starting from the desired position (e.g., a 
y variant position or splice junction in a target RNA, or a site corresponding to a low free 
^ energy value in an OligoWalk analysis) an iterative procedure is used by which the length 
y of the ASR is increased by one base pair until a calculated optimal reaction temperature 
•^u 1 5 p^^s saj^ correction to compensate for enzyme and any other reaction conditions 

0 effects) matching the desired reaction temperature is reached. In general probes are 
m selected to have an ASR with a calculated Tm of about 60 °C if a stacking oligonucleotide 
J J is not used, and a Tm of about 50 to 55 "^C if a stacking ohgonucleotide is used (a stacking 
fl oligonucleotide typically raises the Tm of a flanking probe oligonucleotide by about 5 to 
20 15 ""C). If the position of variation or a splice junction is a starting position, then the 
additions are made to the 3' end of the probe. Alternatively, if the 3' end of the probe is 
to be positioned at the most accessible site, the additions are in the 5' direction. In some 
embodiments, wherein a stacker ohgonucleotide is to be used, it is preferred that the 
probe be designed to have a 3' base that has stable stacking interaction interface with the 
25 5' base of the stacker oligonucleotide. The stability of coaxial stacking is highly 

dependent on the identity of the stacking bases. Overall, the stability trend of coaxial 
stacking in decreasing order is purine:purine > purine :pyrimidine » pyrimidne:purine > 
pyrimidine:pyrimidine. In other embodiments employing a stacker, a less stable stacking 
interaction is preferred; in such cases the probe 3' base and/or the stacker 5' base are 
30 selected to provide a leass stable stacking interaction. In some embodiments, the probe 
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3' base and/or the stacker 5' base are selected to have a mismatch with respect to the 
target strand, to reduce the strength of the stacking interaction. 

The same principles are also followed for INVADER oligonucleotide design. 
Briefly, starting from the position N, additional residues complementary to the target 
5 RNA starting from residue N-1 are then added in the upstream direction until the stability 
of the INVADER-target hybrid exceeds that of the probe (and therefore the planned assay 
reaction temperature). In preferred embodiments, the stabiUty of the INVADER-target 
hj/brid exceeds that of the probe by 12-15 °C. In general, INVADER oligonucleotides 
are selected to have a Tm near 75 °C. Software applications, such as 
10 INVADERCREATOR (Third Wave Technologies, Madison, WI) or Ohgonucleotide 5.0 
may be used to assist in such calculations. 

y If a stacking oligonucleotide is to be used, similar design principles are applied. 

U 

Bi The stacking oligonucleotide is generally designed to hybridize at the site adjacent to the 

sss 

3' end of the probe oligonucleotide, such that the stacker/target helix formed can 
^1 1 5 coaxially stack with the probe/target helix. The sequence is selected to have a calculated 

n Ttn of about 60 to 65 °C, with the calculation based on the use of natural bases. 

Q 

m However, stacking oligonucleotides are generally synthesized using only 2'-0-methyl 

s "-sf • 

:^ nucleotides, and consequently, have actual TmS that are higher than calculated by about 

Q 0.8°C per base, for actual T^s close to 75 ^C. 

' 20 In some embodiments, ARRESTOR oUgonucleotides are included in a secondary 

reaction. ARRESTOR oligonucleotides are provided in a secondary reaction to sequester 
any remaining uncleaved probe from the primary reaction, to preclude interactions 
between the primary probe and the secondary target strand. ARRESTOR 
oligonucleotides are generally 2'-0-methylated, and comprise a portion that is 
25 complementary to essentially all of their respective probe's target-specific region, and a 
portion that is complementary to at least a portion of the probe's flap regions {e.g., six 
nucleotides, counted from the +1 base towards the 5' end of the arm). 
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b. Non-complementary regions 



Probe 5' Arm selection 

The non-complementary arm of the probe, if present, is preferably selected (by an 
iterative process as described above) to allow the secondary reaction to be performed at a 
5 particular reaction temperature. In the secondary reaction, the secondary probe is 
generally cycling, and the cleaved 5' arm (serving as an INVADER oligonucleotide) 
should stably bind to the secondary target strand. 

INVADER oligonucleotide 3' terminal mismatch selection 

y 10 In preferred embodiments, the 3' base of the INVADER ohgonucleotide is not 
0) complementary to the target strand, and is selected in the following order of preference 
(listed as INVADER oligonucleotide 3' base/target base): 
C in target: C/C > A/C > T/C > G/C 

A in target: A/A > C/A > G/A > T/A 

15 G in target: A/G > G/G > T/G > C/G 

U in target: C/U > A/U > T/U > G/U 



m 

Id 



5# 



c. Folding and dimer analysis 

In some embodiments, the oUgonucleotides proposed for use in the INVADER 
20 assay are examined for possible inter- and intra-molecular structure formation in the 
absence of the target RNA. In general, it is desirable for assay probes to have fewer 
predicted inter- or intra molecular interactions. In some embodiments, the program 
OLIGO (e.g., OLIGO 5.0, Molecular Biology Insights, Inc., Cascade, CO) is used for 
such analysis. In other embodiments, the program mfold is used for the analysis. In yet 
25 other embodiments, the RNAStructure program can be used for dimer analysis. The 
following sections provide stepwise instructions for the use of these programs for 
analysis of INVADER assay oligonucleotides. 
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OLIGO 5.0 analysis for Probe structure and interaction prediction. 

Analysis of INVADER oligonucleotides using OLIGO 5.0 comprises the 
following steps. All menu choices are shown in UPPER CASE type. 

5 1. Launch OLIGO 5.0 and open a sequence file for each mRNA to be analyzed. This 
is done by using a menu to select the following 

• Choose FILE- >NEW 

• Paste in longest available sequence 

• Choose ACCEPT & QUIT (F6) 

y 10 2. Set Program settings to default 

f Choose FILE- >RESET- >ORIGINAL DEFAULTS 

y 3. Identify Probe Oligonucleotide 

n • Select OLIGO LENGTH to be around 16 nucleotides (open the menu for this 

n 

option by using ctrl-L keystrokes) . 
2 15 • Move the cursor indicating the 5' end of the Current OUgo until the 3' end is 
Q located at the candidate cleavage site residue. 

^ • Choose ANALYSE- >DUPLEX FORMATION- >CURRENT OLIGO (ctrl-D) 
for a rough determination of the extent of dimer and hairpin formation. 

• Confirm length of analyte region corresponds with desired reaction temperature 
20 [e.g., through the use of Tm calculation as described in the Optimization of 

Reaction Conditions, I (c) of the Detailed Description of the Invention] 

• Select the "LOWER" button in OLIGO 5.0 to copy the anti-sense sequence (this 
will be the analyte-specific region of the actual probe oligonucleotide and is anti- 
sense to the RNA strand,) 

25 • Import into a database file. 

• Save to computer memory. 

4. Identify INVADER Oligonucleotide 

• Choose sequence adjacent to the probe oligonucleotide identified fi:om step 3. 
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• Select OLIGO LENGTH to -24 nucleotides 

• Confirm length of analyte region corresponds with desired reaction temperature 
[e.g., through the use of Tm calculation as described in the Optimization of 
Reaction Conditions, I (c) of the Detailed Description of the hivention, about 75 

for INVADER oligonucleotides). Select the "LOWER" button in OLIGO 5.0 
to copy the corresponding anti-sense sequence (this will be the analyte-specific 
region of the actual INVADER oligonucleotide.) 

• Import into a database file. 

• Save to computer memory. 

5. Addition of Cleaved Arm Sequence and INVADER Oligonucleotide Mismatch 
Sequence. 

• Export the Probe ohgonucleotide as Upper Primer. 

• Export the INVADER ohgonucleotide as Lower Primer, 

• EDIT UPPER PRIMER to add in a candidate arm sequence (selected, for 
example, as described above). 

• Check that the arm sequence does not create new secondary structures (analysis 
performed as described above). 

• EDIT LOWER PRIMER to add in the 3' mismatched nucleotide that will overlap 
into the cleavage site (selected according to the guidelines for this mismatched 
bases, provided above). 

• Select all Upper and Lower Primer boxes in the 'Trint/Save Options" 

• PRINT ANALYSIS of Upper (Probe) and Lower (INVADER) ohgonucleotides 
and check for lack of stable secondary structures. 

• Save both mRNA sequence and oligonucleotide sequence database files before 
quitting the program. 



Generally, ohgonucleotides having detected intra-molecular formations with 
stabilities of less than -6 AG are preferred. Less stable structures represent poor 
substrates for CLEAVASE enzymes, and thus cleavage of such structures is less likely to 
contribute to background signal. Probe and INVADER oligonucleotides having less 
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affinity for each other are more available to bind to the target, ensuring the best cycling 
rates. 

The Tm of dimerized probes (z.e., probes wherein one probe molecule is 
hybridized to another probe molecule) should ideally be lower than the Tm for the probe 
5 hybridized to the target, to ensure that the probes preferentially hybridize to the target 
sequence at the elevated temperatures at which INVADER assay reactions are generally 
conducted. Similarly, the Tm for the INVADER ohgonucleotide hybridized either to 
itself or to a probe molecule should be lower than the INVADER oUgonucleotide/target 
Tm. It is preferred that dimer TmS (i.e., Probe/Probe and Probe/INVADER 
10 oligonucleotide) be 25°C or less to ensure that they will be unlikely to form at the 
p. planned reaction temperature. 

O The melting temperatures for each of these complexes can be determined as 

a described above in Optimization of Reaction Conditions, I (c) of the Detailed Description 
of the Invention, or by using the OLIGO software. Once RNAs sites and several 
15 candidate INVADER assay ohgonucleotide sets are selected according to the process 

ill 

outlined above, the candidate ohgonucleotide sets can be ranked according to the degree 
to which they comply with preferred selection rules, e.g., their location on the SS-Count 
average plot (peak, valley, neither), and the energetic predictions of probe and 
INVADER oligonucleotide interactions. In some embodiments, the ranked probe sets are 
20 tested in order of rank to identify one or more sets having suitable performance in an 
RNA INVADER assay. In other embodiments, several of the top ranked sets (e.g., two, 
three or more) are selected for testing, to rapidly identify one or more sets having suitable 
or desireable performance. 

25 Mfold analysis for probe structure and interaction prediction 

Analysis of probe and INVADER ohgonucleotide interactions may be performed 
using mfold for DNA provided by Michael Zuker, available through Rensselaer 
Polytechnic Institute at bioinfo.math.rpi.edu/~mfold/dna/forml.cgi. The analysis is 
performed without changing the default ionic conditions, and with a selected temperature 
30 of 37 ""C and with % suboptimality set to 75. Each sequence (e.g., probe, INVADER 
ohgonucleotide, stacker, etc.) is folded using the program to check for any unimolecular 



157 



structure formation {e.g., hairpins). The energies provided by mfold gives for 
unimolecular structures can be used as provided, without further calculations. 

Bimolecular structure formation for a given oUgonucleotide is assessed by typing 
in the oUgonucleotide sequence (5* to 3') followed by the sequence of a small, stable 
haiipin forming sequence (e.g., CCCCCTTTTGGGGG [SEQ ID NO:707]), followed by 
the same oUgonucleotide sequence, again Usted 5' to 3. Constraints are entered to require 
that these Ts remain single-stranded and the strings of Cs and Gs in this spacer are 
basepaired. The command "F" is used to force basepairing, while the command "P" is 
used to prohibit basepairing, and the positions of the forced or prohibited basepairs are 
counted from the 5' end. For example, if the sequence of interest is a 20-mer, then the 
following is entered: 

F 21 0 5 [this forces the C's, C21 to C25, to base pair] 

P 26 0 4 [this forces the T's, T26 to T29, to be single stranded] 

F 30 0 5 [this forces the G's, G30 to G34, to base pair] 

On examination of the resulting structures, the stability of each can be estimated 
by subtracting the stability (i.e., the thermodynamic measures) of the central spacer 
hairpin from the total result (i.e., Thermodynamics of possible structure = mfold structure 
thermodynamics - core hairpin thermodynamics). For convenience, in some 
embodiments, any nearest neighbor interactions between the central hairpin and dimers 
formed by the test sequence are ignored for this calculation; a more accurate analysis 
would require consideration of this interaction. The core hairpin formed by 
CCCCCTTTTGGGGG (SEQ ID NO:707) has the following thermodynamics: AG = -5.3; 
AH = -37.8; AS = -104.8, 

The process can be demonstrated using the following probe sequence: 5'- 
CCCTATCTTTAAAGTTTTTAAAAAGTTTGA-3' (SEQ ID NO:708). The 
oUgonucleotide sequence is examined by mfold analysis for bimolecular structures using 
the following steps. 
1- In mfold sequence box type: 

CCCTATCTTTAAAGTTTTTAAAAAGTTTGACCCCCTTTTGGGGGCCC 
TATCTTTAAAGTTTTTAAAAAGTTTGA (SEQ ID NO: 1 37) 



158 



2- In the consteaint box type: 
P3604 
F31 05 
F4005 

5 

Results (showing one): 

structure 1 

dG = -14.2 dH = -150.5 dS = -439.5 Tm = 69.3 
CCCTATCTTT |G G T 

10 AAA TTTTTAAAAA TTTGA CCCCC T 

TTT AAAAATTTTT AAATT GGGGG T 
AG ^G G TCTATCCC T 

To evaluate the stability of the duplex: 

CCCTATCTTT | G G 

AAA TTTTTAAAAA TTTGA 
TTT AAAAATTTTT AAATT 
AG ^G G TCTATCCC 

the thermodyanamic values for the hairpin alone are subtracted from the values for the 
complete structure: 

AG = -14.2 -(-5.3) = -8.9, 
AH = - 150.5 - (-37.8) = -1 12.7, 
AS = -439,5 - (-104.8) = -334.7, 
Using a calculation wherein Tm CQ = {AH / [AS + R In (CT/4)]} - 273.15, wherein R is 
the gas constant 1.987 (cal/K.mol), In is the natural log, and CT is the total single strand 
concentration in Molar, this results in a calculated Tm of 46.1 for the non-hairpin 
portion of the structure. 

The above method is not limited to the use of the core hairpin sequence 
CCCCCTTTTGGGGG but rather any stable hairpin sequences can be used. For 
example, CGCGCGGAACGCGCG (SEQ ID NO: 138) or CCCGGGTTTTCCCGGG 
(SEQ ID NO: 139). However, if a different hairpin sequence is used, one needs to 
calculate its stability using mfold and use its thermodynamics in the subsequent 
calculations. 
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RNAStructure for oligonucleotide interaction prediction 

Dimer formation can also be evaluated using the RNAStructure program. Unlike 
mfold, RNAStructure allows the calculation of all possible oligonucleotide- 
oligonucleotide interactions and provides an output .ct file. One can then view the 
5 structures using any .ct viewing program such as RNAStructure or RNAvis (1997, P. 
Rijk, University of Antwerp (UIA), available on the Internet at rma.uia.ac.be/mavis) and 
evaluate the stability of any dimer formation using the nearest-neighbor model (Borer et 
al., 1974) and DNA nearest-neighbor parameters (AUawi & SantaLucia, 1997). 
For example, to evaluate the propensity of the sequence 5' 

■: 10 AGGCGCACCAATTTGGTGTT 3' (SEQ ID NO: 140) for dimer formation using the 
DNA Fold hitermolecular module of RNAStracture, the sequence is saved into a file 

© {^'S'^ probe.seq) and the following parameters are set: 

jSj Sequence file 1 : probe.seq 

:§ Sequence file 2: probe.seq 

Q 15 CT file: dimer.ct 

■"^ Max % Energy difference: 50 

it? 

O Max number of structures: 20 
m Window size: do not change 

fl 20 After the calculation is done, one can view the resulting .ct file using the "view" module 
of RNAStmcture. Generally, there will be several structures within the .ct file. The view 
module is used to view them individually. One of the dimers that the test sequece, above, 
can form according to RNAStmcture is: 

25 



AGGCG TT 
CACCAATTTGGTG 
30 GTGGTTTAACCAC 

TT GCGGA 



160 



According to the nearest-neighbor model (ie., using DNA nearest-neighbor and 
mismatch parameters [Allawi & SantaLucia, 1997]), the stability of this duplex in IM 
NaCl and at a probe concentration of 100 jiM is: 
5 AG^7 = -10.07 
AH=-87.6 
AS=-250.1 
Tm=50.1^C 

10 By changing the identities of Sequence Files 1 & 2, RNAStructure can be used to 
1-^ evaluate the possibility of any dimer formation between pairs of all of the DNA 
Q oligonucleotides present in an INVADER assay reaction. 

I3| iv. Assay performance evaluation 

5i 15 Probe sets selected according to the guidelines provided above can be tested in the 

INFVADER assay to evaluate performance. While the oligonucleotides are designed to 

fl perform at or near a particular desired reaction temperature, the best performance for a 

f9 

j4 given design may not be precisely at the intended temperature. Thus, in evaluating any 

O new INVADER assay probe set, it can be helpful to examine the performance in the 

fl 

20 INVADER assay conducted at several different reaction temperatures, over a range of 
about 10 to 15 centered around the designed temperature. For convenience, 
temperature optimization can be performed on a temperature gradient thermocycler with 
a fixed amount of RNA {e.g., 2.5 amoles of an in vitro transcript per reaction), and for a 
fixed amount of time (e.g., 1 hour each for Primary and Secondary reactions). The 

25 temperature gradient test will reveal the temperature at which the designed probe set 
produces the best performance (e.g., the highest level of target-specific signal compared 
to background signal, generally expressed as a multiple of the zero-target background 
signal, or "fold over zero"). 

The results can be examined to see how close the measured temperature optimum 

30 is to the intended temperature of operation. In some embodiments, it is desirable to have 
probe sets that operate at or near a pre-selected temperature. If the measured temperature 
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optimum is higher than the desired reaction temperature, a probe design can be altered in 
ways that tend to reduce the probe/target Tm (e.g. , shortened by one or more bases, or 
altered to contain one or more mismatched bases). In some embodiments, wherein a 
stacker oligonucleotide is not used, wherein the reaction temperature is more than 7 
above the desired reaction temperature, and wherein the performance (e.g., the fold over 
zero) is acceptable, use of a 3' mismatch on the probe ohgonucleotide is likely to lower 
the reaction temperature without otherwise altering the assay performance. 

An LOD determination can be made by performing reactions on varying amounts 
of target RNA (e.g., an in vitro transcript control RNA of known concentration), hi 
preferred embodiments, a designed assay has an LOD of less than 0.05 attomole. In 
particularly preferred embodiments, a designed assay has an LOD of less than 0.01 
attomole. It is contemplated that the same guideline provided above for reducing the 
LOD of a designed assay may be used for the purpose of raising the LOD of a designed 
assay, i.e., to make it LESS sensitive to the presence of a target RNA. For example, it 
may be desirable to detect an abundant RNA md a rare RNA in the same reaction. In 
such a reaction, it may be desirable to attenuate the signal generated for the abundant 
RNA so that it does not overwhebn the signal from the rarer species. In some 
embodiments this may be done by designing probe sets for reduced signal generation, 
e.g., an LOD of at least (not less than) 0.5 attomoles. In some embodiments, a single step 
INVADER assay may be used for detection of abundant targets in a sample, while 
sequential INVADER reactions to amplify signal, as described in Section II, may be used 
for less abundant analytes in the same sample. In preferred embodiments, the single step 
and the sequential INVADER assay reactions for the different analytes are performed in a 
single reaction. 

In some embodiments, time course reactions are run, wherein the accumulation of 
signal for a known amount of target is measured for reactions run for different lengths of 
time. This measurement will establish the linear ranges, i.e., the ranges in which accurate 
quantitative measurements can be made using a given assay design, with respect to time 
and starting target RNA level. 
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y. Design and assay optimization 

Some designed assays may not meet the preferred performance criteria described 
above. A number of variations on the performance of INVADER assay reactions have 
been described herein. In optimizing performance of the INVADER assay for the 

5 detection of RNA targets, these variations may be used alone or in combination. For 
example, in some embodiments, a stacker oligonucleotide is employed. While not 
limiting the present invention to any particular mechanism of action, in some 
embodiments, a stacker oligonucleotide may enhance performmce of an assay by altering 
the hybridization characteristics (eg., Trtd of a probe or an INVADER ohgonucleotide. 

10 In some embodiments, a stacker oligonucleotide may increase performance by enabUng 
the use of a shorter probe. In other embodiments, a stacker ohgonucleotide may enhance 
performance by altering the folded structure of the target nucleic acid. In yet other 
embodiments, the enhancing activity of the stacker ohgonucleotide may involve these 
and other mechanisms in combination. 

15 In other embodiments, the target site may be shifted. In some embodiments, 

reactions are optimized by testing multiple probe sets that shift along a suspected 
accessible site. In preferred embodiments, such probe sets shift along the accessible site 
in one to two base increments. In embodiments wherein accessible sites have previously 
been predicted only by computer analysis, physical detection of the accessible sites may 

20 be employed to optimize a probe set design. In preferred embodiments, the RT-ROL 

method of detecting accessible sites is employed. In some embodiments, optimization of 
a probe set design may require shifting of the target site to a newly identified accessible 
site. 

In some embodiments, e.g., wherein an accessible site has been identified yet 
25 probe set performance is low, a change in the design of a probe 5' arm may improve 

assay performance without altering the site targeted. In other embodiments, altering the 
length of an ARRESTOR ohgonucleotide {e.g., increasing the length of the portion that is 
complementary to the 5' arm region of the probe) may reduce background signal, thus 
increasing the probe stet performance. 
30 Other variations on ohgonucleotide design may be employed to alter performance 

in an assay. Some modifications may be employed to shift the ideal operating 
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temperature of a probe set design into a preferred temperature range. For example, the 
use of shorter oligonucleotides and the incorporation of mismatches generally act to 
reduce the TmS, and thus reduce the ideal operating temperatures, of designed 
oligonucleotides. Conversely, the use of longer oligonucleotides and the employment of 
5 stacking oligonucleotides generally act to increase the TmS, and thus increase the ideal 
operating temperatures of the designed oligonucleotides. 

Other modifications may be employed to alter other aspects of oUgonucleotide 
performance in an assay. For example, the use of base analogs or modified bases can 
alter enzyme recognition of the oUgonucleotide. In some embodiments, such modified 
1 0 bases are used to protect a region of an oligonucleotide fi'om nuclease cleavage. In other 
embodiments, modified bases are used to affect the ability of an oligonucleotide to 
participate as a member of a cleavage structure that is not in a position to be cleaved (e.g., 
to serve as an INVADER oligonucleotide to enable cleavage of a probe). These modified 
bases maybe referred to as "blocker" or "blocking" modifications. In some 
W 15 embodiments, assay oligonucleotides incorporate 2'-0-methyl modifications. In other 
embodiments, assay oUgonucleotides incorporate 3' terminal modifications (e.g., NHi, 3' 
hexanol, 3' phosphate, 3' biotin). 
f In yet other embodiments, the components of the reaction may be altered to affect 

g assay performance. For example, oligonucleotide concentrations may be varied. 

20 OUgonucleotide concentrations can affect multiple aspects of the reaction. Since melting 
temperatures of complexes are partly a fimction of the concentrations of the components 
of the complex, variation of the concentrations of the oUgonucleotide components can be 
used as one facet of reaction optimization. In the methods of the present invention, 
ARRESTOR oUgonucleotides may be used to modulate the availability of the primary 
25 probe oligonucleotides in an INVADER assay reaction. In some embodiments, an 

ARRESTOR oligonucleotide may be excluded. Other reaction components may also be 
varied, including enzyme concentration, salt and divalent ion concentration and identity. 
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VI. Kits for performing the RNA INVADER Assay 
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In some embodiments, the present invention provides kits comprising one or more 
of the components necessary for practicing the present invention. For example, the 
present invention provides kits for storing or deUvering the enzymes of the present 
invention and/or the reaction components necessary to practice a cleavage assay (e.g., the 
INVADER assay). The kit may include any and all components necessary or desired for 
the enzymes or assays including, but not limited to, the reagents themselves, buffers, 
control reagents (e.g., tissue samples, positive and negative control target 
oligonucleotides, etc.), solid supports, labels, written and/or pictorial instructions and 
product information, inhibitors, labeling and/or detection reagents, package 
environmental controls (e.g., ice, desiccants, etc.), and the like. In some embodiments, 
the kits provide a sub-set of the required components, wherein it is expected that the user 
will supply the remaining components. In some embodiments, the kits comprise two or 
more separate containers wherein each container houses a subset of the components to be 
dehvered. For example, a first container (e.g., box) may contain an enzyme (e.g., 
structure specific cleavage enzyme in a suitable storage buffer and container), while a 
second box may contain oligonucleotides (e.g., INVADER ohgonucleotides, probe 
oUgonucleotides, control target oligonucleotides, etc.). In some embodiments one or 
more the reaction components may be provided in a predispensed format (i.e., 
premeasured for use in a step of the procedure without re-measurement or re-dispensing). 
In some embodiments, selected reaction components are mixed and predispensed 
together. In preferred embodiments, predispensed reaction components are predispensed 
and are provided in a reaction vessel (including but not limited to a reaction tube or a 
well, as in, e.g., a microtiter plate). In particularly preferred embodiments, predispensed 
reaction components are dried down (e.g., desiccated or lyophihzed) in a reaction vessel. 

Additionally, in some embodiments, the present invention provides methods of 
delivering kits or reagents to customers for use in the methods of the present invention. 
The methods of the present invention are not hmited to a particular group of customers. 
Indeed, the methods of the present invention find use in the providing of kits or reagents 
to customers in many sectors of the biological and medical community, including, but not 
hmited to customers in academic research labs, customers in the biotechnology and 
medical industries, and customers in governmental labs. The methods of the present 
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invention provide for all aspects of providing the kits or reagents to the customers, 
including, but not limited to, marketing, sales, delivery, and technical support. 

In some embodiments of the present invention, quality control (QC) and/or 
quality assurance (QA) experiments are conducted prior to delivery of the kits or reagents 
5 to customers. Such QC and QA techniques typically involve testing the reagents in 
experiments similar to the intended commercial uses (e.g., using assays similar to those 
described herein). Testing may include experiments to determine shelf life of products 
and their ability to withstand a wide range of solution and/or reaction conditions (e.g. , 
temperature, pH, Hght, etc.). 
10 In some embodiments of the present invention, the compositions and/or methods 

of the present invention are disclosed and/or demonstrated to customers prior to sale (e.g., 
through printed or web-based advertising, demonstrations, etc.) indicating the use or 

Q 

fj functionaUty of the present invention or components of the present invention. However, 
^-J in some embodiments, customers are not informed of the presence or use of one or more 
15 components in the product being sold. In such embodiments, sales are developed, for 
example, through the improved and/or desired function of the product (e.g. , kit) rather 
than through knowledge of why or how it works (i e., the user need not know the 

ft 

components of kits or reaction mixtures). Thus, the present invention contemplates 

in 

y| making kits, reagents, or assays available to users, whether or not the user has knowledge 
Cl 20 ofthe components orworidngs of the system. 

Accordingly, in some embodiments, sales and marketing efforts present 
information about the novel and/or improved properties ofthe methods and compositions 
of the present invention. In other embodiments, such mechanistic information is withheld 
from marketing materials. In some embodiments, customers are surveyed to obtain 
25 information about the type of assay components or delivery systems that most suits their 
needs. Such information is useful in the design ofthe components ofthe kit and the 
design of marketing efforts. 

VII. The INVADER Assay for Direct Detection and Measurement of Specific 
30 RNA Analytes. 
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The following section provides a few illustrative examples of mRNAs that may be 
detected or measured using the methods, compositions and systems of the present 
invention. Application of the methods and compositions of the present invention is not 
limited to these target nucleic acid. 

HOUSEKEEPING CONTROLS 

RNAs that are generally present in predicable or invariant amounts in test samples 
provide useful control targets for detection assays. These controls can be useful in 
several ways, including but not limited to providing confranation of the proper function 
of an assay, and as a standard against which a test result for another RNA can be 
compared or measured to aid in interpretation of a result. mRNAs for the following 
genes find particular use in the methods of the present invention. 
Human Ubiquitin and Mouse/Rat Ubiquitin 

The ubiquitin system is a major pathway for selective protein degradation. 
Degradation by this system is instrumental in a variety of cellular fimctions such as DNA 
repair, cell cycle progression, signal transduction, transcription, and antigen presentation. 
The ubiquitin pathway also eliminates proteins that are misfolded, misplaced, or that are 
in other ways abnormal. This pathway requires the covalent attachment of ubiquitin 
(El), a highly conserved 76 amino acid protein, to defined lysine residues of substrate 
proteins. 

Human, rat and mouse gIyceraldehyde-3-phosphate dehydrogenase (GAPDH) 

GAPDH is an important enzyme in the glycolysis and gluconeogenesis pathways. 
This homotetrameric enzyme catalyzes the oxidative phosphorylation of D- 
glyceraldehyde-3 -phosphate to 1,3-diphosphoglycerate in the presence of cofactor and 
inorganic phosphate. A variety of diverse biological properties of GAPDH have been 
reported. These include functions in endocytosis, niRNA regulation, tRNA export, DNA 
replication, DNA repair, and neuronal apoptosis. 
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CYTOKINES 

A growing fimiily of regulatory proteins that deliver signals between cells of the 
immune system has been identified. Called cytokines, these proteins have been found to 
control the growth and development, and bioactivities, of cells of the hematopoietic and 
immune systems. C3^okines exhibit a wide range of biological activities with target cells 
from bone marrow, peripheral blood, fetal liver, and other lymphoid or hematopoietic 
organs. The present invention describes methods for the detection of expression of 
cytokines, including but not limited to of the exemplary members of the cytokine family 
listed below. 

Human Oncostatin M 

Oncostatin M is a secreted single-chain polypeptide cytokine that regulates the 
growth of certain tumor-derived and normal cell lines. A number of cell types have been 
found to bind the oncostatin M protein. While it has been shown to inhibit proUferation 
of a number of tumor cell types, it has also been implicated in stimulating proliferation of 
Kaposi*s sarcoma cells. 

Human transforming growth factor-beta (TGF-p) 

Transforming growth factor-beta (TGF-beta) is a member of a family of 
structurally-related cytokines that elicit a variety of responses, including growth, 
differentiation, and morphogenesis, in many different cell types. In vertebrates, at least 
five different forms of TGF-beta, termed TGF-betal to TGF-beta5, have been identified; 
they all share a high degree (60%-80%) of amino-acid sequence identity. While TGF- 
betal was initially characterized by its ability to induce anchorage-independent growth of 
normal rat kidney cells, its effects on most cell types are anti-mitogenic. It is strongly 
growth-inhibitory for many types of cells, including both normal and transformed 
epithelial, endothehal, fibroblast, neuronal, lymphoid, and hematopoietic cells. In 
addition, TGF-beta plays a central role in regulating the formation of extracellular matrix 
^d cell-matrix adhesion processes. 
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Human monocyte chemoattractant protein-1 (MCP-1) 

Within this family of cytokines, an emerging group of chemotactic cytokines, also 
called chemokines or intercrines, has been identified. Two subfamilies of chemokines 
have been recognized, alpha and beta, based on chromosomal location and the 
arrangement of the cysteine residues. 

The human genes encoding the beta subfamily proteins are located on 
chromosome 17 (their mouse counterparts are clustered on mouse chromosome 11, which 
is the counterpart of human chromosome 17). Homology in the beta subfamily ranges 
from 28-45% intraspecies, from 25-55% interspecies. An exemplary member is the 
human protein MCP-1 (monocyte chemoattractant protein-1). MCP-1 exerts several 
effects specifically on monocytes. It is a potent chemoattractant for human monocytes in 
vitro and can stimulate an increase in cytosolic free calcium and the respiratory burst in 
monocytes. MCP-1 has been reported to activate monocyte-mediated tumoristatic 
activity, as well as to induce tumoricidal activity. MCP-1 has been implicated as an 
important factor in mediating monocytic infiltration of tissues inflammatory processes 
such as rheumatoid arthritis and alveolitis. The factor may also play a fundamental role 
in the recruitment of monocyte-macrophages into developing atherosclerotic lesions. 

Human tumor necrosis factor alpha (TNF-a) 

Tumor necrosis factor alpha (TNF-alpha also cachectin) is an important cytokine 
that plays a role in host defense. The cytokine is produced primarily in macrophages and 
monocytes in response to infection, invasion, injury, or inflammation. Some examples of 
inducers of TNF-alpha include bacterial endotoxins, bacteria, viruses, lipopolysaccharide 
(LPS) and cytokines including GM-CSF, IL-1, IL-2 and IFN-gamma. 

Despite the protective effects of the cytokine, overexpression of TNF-alpha often 
results in disease states, particularly in infectious, inflammatory and autoimmune 
diseases. This process may involve the apoptotic pathways. High levels of plasma TNF- 
alpha have been found in infectious diseases such as sepsis syndrome, bacterial 
meningitis, cerebral malaria, and AIDS; autoimmune diseases such as rheumatoid 
arthritis, inflammatory bowel disease (including Crohn's disease), sarcoidosis, multiple 
sclerosis, Kawasaki syndrome, graft-versus-host disease and transplant (allograft) 
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rejection; and organ failure conditions such as adult respiratory distress syndrome, 
congestive heart failure, acute liver failure and myocardial infection. Other diseases in 
which TNF-alpha is involved include asthma, brain injury following ischemia, non- 
insulin-dependent diabetes mellitus, insulin-dependent diabetes mellitus, hepatitis, atopic 
dermatitis, and pancreatitis. Further, inhibitors of TNF-alpha have been suggested to be 
useful for cancer prevention. Elevated TNF-alpha expression may also play a role in 
obesity. TNF-alpha was found to be expressed in hxmian adipocytes and increased 
expression, in general, correlated with obesity. 

Human mterleukin-6 (IL-6) 

IL-6 is the standardized name of a cytokine called B lymphocyte differentiating 
factor, interferon beta2, 26 Kd protein, hybridoma/plasmacytoma growth factor, 
hepatocyte stimulating factor, etc. 

IL-6 induces activated B cells to be differentiated into antibody forming cells. 
For T cells, IL-6 induces T cells stimulated by mitogens to produce IL-2 and induces the 
expression of IL-2 receptor on a certain T cell line or thymocytes. For blood forming 
cells, IL-6 induces the growth of blood forming stem cells synergistically in the presence 
of IL-3. Furthermore, recently, it was reported that IL-6 acted like thrombopoietin. 

IL-6 is produced by various cells. It is produced by lymphocytes and is also 
produced by human fibroblasts stimulated by Poly (I)-Poly (C) and cycloheximide. 
Murine IL-6 is produced in mouse cells, which are stimulated by Poly (A)-Poly (U). 
Inducers for stimulation are diverse, and include known cytokines such as IL-1, TNF and 
IFN-beta, growth factors such as PDGF and TGF-beta, LPS, PMA, PHA, cholera toxin, 
etc. Moreover, it is reported that human vascular endothelial cells, macrophages, human 
glioblastomas, etc. also produce IL-6. Furthermore, it is also known that the productivity 
can be further enhanced by stimulating cells using an inducer and subsequently treating 
the cells by a metabolic inhibitor such as verapamil, cycloheximide or actinomycin D, 
etc. 
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Human interleukin Ibeta (IL-ip) 

Interleukin-1 (IL-1) is important to the activation of T and B lymphocytes and 
mediates many inflammatory processes. Two distinct forms of IL-1 have been isolated 
and expressed; termed IL-lbeta and IL-lalpha. IL-lbeta is the predominant form 
5 produced by human monocytes both at the mRNA and protein level. The two forms of 
human IL-1 share only 26% amino acid homology. Despite their distinct polypeptide 
sequences, the two forms of IL-1 have structural similarities, in that the amino acid 
homology is confined to discrete regions of the IL-1 molecule. The two forms of IL-1 
also possess identical biological properties, including induction of fever, slow wave 
10 sleep, and neutrophilia, T- and B-lymphocyte activation, fibroblast proliferation, 
J . cytotoxicity for certain cells, induction of collagenases, synthesis of hepatic acute phase 

Q proteins, and increased production of colony stimulating factors and collagen. IL-1 also 
jjl activates endothelial cells, resulting in increased leukocyte adhesiveness, PGI2 and PGE2 
J (prostaglandins) release, and synthesis of platelet activating factor, procoagulant activity, 
IM 1 5 and a plasminogen activator inhibitor. Clearly, IL-1 plays a central role in local and 

systemic host responses. Because many of the biological effects of IL- 1 are produced at 
^ picomolar (pg) concentrations in vivo, IL-1 production is likely a fundamental 
f% characteristic of host defense mechanisms. 

n 20 Human interleukin 2 (IL-2) 

Interleukin-2 (IL-2) is the main growth factor of T lymphocytes. By regulating T 
helper lymphocyte activity IL-2 increases the humoral and cellular immune responses. 
By stimulating cytotoxic CDS T cells and NK cells, this cytokine participates in the 
defense mechanisms against tumors and viral infections. IL-2 is used in therapy against 

25 metastatic melanoma and renal adenocarcinoma. IL-2 is used in clinical trials in many 
forms of cancer. It is also used in HIV infected patients and leads to a significant 
increase in CD4 counts. Human IL-2 is a protein of 133 amino acids (aa) composed of 
four alpha helices connected by loops of various length, its tridimensional structure has 
been established. IL-2R is composed of three chains alpha, beta and gamma. IL2Ralpha 

30 controls the affinity of the receptor IL-2Rbeta and IL-2Rgamma are responsible for IL-2 
signal transduction. The different molecular areas of IL-2 interacting with the three 
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chains of the IL-2 R have been defined. More specifically it has been determined that a 
helix A as well as the NH2 terminal area of IL-2 (residues 1 to 30) control the interactions 
IL-2/IL-2Rbeta. 



5 Human interleukin 8 (IL-8) 

Human IL-8 is a cytokine that has variously been called neutrophil-activating 
protein, neutrophil chemotactic factor (NCF) and T-cell chemotactic factor. IL-8 can be 
secreted by several types of cells upon appropriate stimulation. IL-8 is secreted by 
activated monocytes and macrophages as well as by embryonic fibroblasts. 
, ; 10 IL-8 is known to induce neutrophil migration aad to activate fimctions of 

neutrophils such as degranulation, release of superoxide anion and adhesion to the 
H}' endotheUal cell monolayer. There are a number of conditions that are known to involve 
p leukocyte infiltration into lesions. These include pulmonary diseases such as pulmonary 
% cystic fibrosis, idiopathic pulmonary fibrosis, adult respiratory distress syndrome, 
Ci 15 sarcoidosis and empyema; dermal diseases such as psoriasis, rheumatoid arthritis; and 
^ inflammatory bowel disease (Crohn's Disease). 

The amino acid sequence characterizing human IL-8 was described by 
h| Matsushima, et al. in PCT application WO89/08665. More recently, it was shown that 
Ijl monocyte-derived IL-8 was evidently variably processed at the N-terminus and that the 
W 20 IL-8 originally disclosed by Matsushima et al, was accompanied by two forms of the 
factor which had seven or five additional amino acids at the N-terminus (Yoshimura, et 
al, Mol Immunol 26:87 [1989]). The longest form accounted for about 8%, the next 
longest form for about 47%, and the shortest form for about 45% of the total IL-8 derived 
firom monocytes. 

25 

Human interleukin 10 (IL-10) 

Interleukin- 10 (IL-10), a recently discovered lymphokine, was originally 
described as an inhibitor of interferon-gamma synthesis and is postulated as a major 
mediator of the hxmxoral class of immune response. Two classes of often mutually 
30 exclusive immune responses are the humoral (antibody-mediated) and the delayed-type 
hypersensitivity. 
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It is postulated that these two differing immune responses may arise from two 
types of helper T-cell clones, namely Thl and Th2 helper T-cells, which demonstrate 
distinct cytokine secretion pattems. Mouse Thl cell clones secrete interferon-gamma, 
and IL-2 and preferentially induce the delayed-type hypersensitivity response while Th-2 
5 cell clones secrete IL-4, IL-5 and IL-10 and provide support for the humoral responses. 
The contrast in immune response could result because interferon-gamma secreted by the 
Thl cell clones inhibits Th2 clone proUferation in vitro, while IL-10 secreted by the Th2 
cell clones inhibits cytokine secretion by the Thl cell clones. Thus the two T-helper cell 
types may be mutually inhibitory and may provide the underpinning for the two 
10 dissimilar immune responses. 

IL-10 has been cloned and sequenced from both murine and human T cells. Both 
Mi sequences contain an open reading frame encoding a polypeptide of 178 amino acids with 
S an N-terminal hydrophobic leader sequence of 1 8 amino acids, and have an amino acid 
B sequence homology of 73%. 

i 15 

"% Human interleukin 4 (IL-4) 

Interleukin-4 (IL-4, also known as B cell stimulating factor, or BSF-1) was 
j^l ongmally characterized by its ability to stimulate the proliferation of B cells in response 

to low concentrations of antibodies directed to surface immunoglobulin. More recently, 
Q 20 IL-4 has been shown to possess a far broader spectrum of biological activities, including 
growth co-stimulation of T cells, mast cells, granulocytes, megakaryocytes, and 
erythrocytes. In addition, IL-4 stimulates the proliferation of several IL-2- and IL-3- 
dependent cell Unes, induces the expression of class n major histocompatibihty complex 
molecules on resting B cells, and enhances the secretion of IgE and IgGl isotypes by 
25 stimulated B cells. Both murine and human IL-4 have been definitively characterized by 
recombinant DNA technology and by purification to homogeneity of the natural murine 
protein. 

The biological activities of IL-4 are mediated by specific cell surface receptors for 
IL-4 that are expressed on primary cells and in vitro cell lines of mammaUan origin. IL-4 
30 binds to the receptor, which then transduces a biological signal to various immune 
effector cells. 
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Human interferon gamma (IFN-y) 

Like the interleukins, interferons belong to the class of the cytokines and are listed 
in various classes: interferon-alpha, interferon-beta, interferon-gancuna, interferon-omega 
and interferon-tau. Interferon-gamma is a glycoprotein, the amino acid sequence of 
which has been known since 1982. In the mature condition the interferon-gamma has 
143 amino acids and a molecular weight of 63 to 73 kilodaltons. 

The tertiary and quaternary structure of the non-glycosylised protein was clarified 
in 1991. According to this, interferon-gamma exists as a homodimer, the monomers 
being orientated in contrary directions in such a way that the C-terminal end of one 
monomer is located in the vicinity of the N-terminal end of the other monomer. Each of 
these monomers in all has six alpha-helices. Interferon-gamma is also called 
immunointerferon, as it has non-specific antiviral, antiproliferative and in particular 
immunomodulatory effects. Its production in T-helper-lymphocytes is stimulated by 
mitogens and antigens. The effect of the expressed interferon-gamma has not yet been 
precisely clarified, but is subject to intensive research. In particular, interferon-gamma 
leads to the activation of macrophages and to the synthesis of histocompatability antigens 
of the class 2. In vitro, the activity of interferon-gamma is normally determined as a 
reduction in the virus-induced cytopathic effect, which arises from treatment with 
interferon-ganama. Due to its antigen-non-specific antiviral, antiproliferative and 
immunomodulatory activity it is suitable as a human therapeutic agent, for example of 
kidney tumours and chronic granulomatosis. 

CYTOCHROME P450s 

The term cytochrome P-450 refers to a family of enzymes (located on the 
endoplasmic reticulum, with high concentrations of the proteins in the cells of the liver 
and small intestine) that are primarily responsible for the metabolism of xenobiotics such 
as drugs, carcinogens and environmental chemicals, as well as several classes of 
endobiotics such as steroids and prostaglandins. Members of the cytochrome P450 
family are present in varying levels and their expression and activities are controlled by 
variables such as chemical environment, sex, developmental stage, nutrition and age. 
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More than 200 cytochrome P450 genes have been identified. There are multiple 
forms of these P450 genes and each of the individual forms exhibit degrees of specificity 
towards individual chemicals in the above classes of compounds. In some cases, a 
substrate, whether it be drug or carcinogen, is metabolized by more then one of the 
cytochromes P450. Genetic polymorphisms of cytochromes P450 result in 
phenotypically-distinct subpopulations that differ in their abiUty to perform 
biotransformations of particular drugs and other chemical compounds. 

The present invention provides methods for the detection cytochrome P450 
mRNAs, including but not limited to, Human CYP lAl, Human CYP 1A2, Human CYP 
2B1, Human CYP 2B2, Human CYP 2B6, Human CYP 2C19, Human CYP 2C9, Human 
CYP 2D6, Human CYP 3A4, Human CYP 3A5, Human CYP 3A7, Rat CYP 2E1, Rat 
CYP 3A1, Rat CYP 3A2, Rat CYP 4A1, Rat CYP 4A2 and Rat CYP 4A3. 
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EXAMPLES 



The following examples serve to illustrate certain preferred embodiments and 
aspects of the present invention and are not to be construed as limiting the scope thereof. 

hi the disclosure which follows, the foUov^ng abbreviations apply: Afu 
(Archaeoglobus fulgidus); Mth (Methanobacterium thermoautotrophicum); Mja 
(Methanococcus jannaschii); Pfu (Pyrococcus furiosus); Pwo (Pyrococcus woesei); Taq 
{Thermus aquaticus); Taq DNAP, DlSiAPTaq, and Taq Pol I (T aquaticus DNA 
polymerase I); DNAPStf (the Stoffel fragment of DNAPJa^); DNAPEcl {E. coli DNA 
polymerase I); Tth {Thermus thermophilus); Ex. (Example); Fig. (Figure);°C (degrees 
Centigrade); g (gravitational field); hr (hour); min (minute); olio (oUgonucleotide); rxn 
(reaction); vol (volume); w/v (weight to volume); v/v (volume to volume); BSA (bovine 
serum albumin); CTAB (cetyltrimethylammonium bromide); HPLC (high pressure Uquid 
chromatography); DNA (deoxyribonucleic acid); p (plasmid); ]A (microliters); ml 
(milliliters); \xg (micrograms); mg (milUgrams); M (molar); mM (milliMolar); |iM 
(microMolar); pmoles (picomoles); amoles (attomoles); zmoles (zeptomoles); 
nm (nanometers); kdal (kilodaltons); OD (optical density); EDTA (ethylene diamine 
tetra-acetic acid); FITC (fluorescein isothiocyanate); SDS (sodium dodecyl sulfate); 
NaP04 (sodium phosphate); NP-40 (Nonidet P-40); Tris (tris(hydroxymethyl)- 
aminomethane); PMSF (phenylmethylsulfonylfluoride); TBE (Tris-Borate-EDTA, z.e., 
Tris buffer titrated with boric acid rather than HCl and containing EDTA); 
PBS (phosphate buffered saline); PPBS (phosphate buffered saline containing 1 mM 
PMSF); PAGE (polyacrylamide gel electrophoresis); Tween (polyoxyethylene-sorbitan); 
ATCC (American Type Culture Collection, Rockville, MD); Coriell (Coriell Cell 
Repositories, Camden, NJ); DSMZ (Deutsche Sammlung von Mikroorganismen und 
Zellculturen, Braunschweig, Germany); Ambion (Anibion, Inc., Austin, TX); Boehringer 
(Boehringer Mannheim Biochemical, IndiaaapoUs, IN); MJ Research (MJ Research, 
Watertown, MA; Sigma (Sigma Chemical Company, St, Louis, MO); Dynal (Dynal A.S., 
Oslo, Norway); Gull (Gull Laboratories, Salt Lake City, UT); Epicentre (Epicentre 
Technologies, Madison, WI); Lampire (Biological Labs., Inc., Coopersberg, PA); MJ 



176 



Research (MJ Research, Watertown,MA); National Biosciences (National Biosciences, 
Plymouth, MN); NEB (New England Biolabs, Beverly, MA); Novagen (Novagen, Inc., 
Madison, WI); ; Promega (Promega, Corp., Madison, WT); Stratagene (Stratagene 
Cloning Systems, La JoUa, CA); Clonetech (Clonetech, Palo Alto, CA) Pharmacia 
(Pharmacia, Piscataway, NJ); Milton Roy (Milton Roy, Rochester, NY); Amersham 
(Amersham Litemational, Chicago, IL); and USB (U.S. Biochemical, Cleveland, OH). 
Glen Research (Glen Research, Steriing, VA); Coriell (Coriell Cell Repositories, 
Camden, NJ); Gentra (Gentra, Minneapohs, MN); Third Wave Technologies (Third 
Wave Technologies, Madison, WI); PerSeptive Biosystems (PerSeptive Biosystems, 
Framington, MA); Microsoft (Microsoft, Redmond, WA); Qiagen (Qiagen, Valencia, 
CA); Molecular Probes (Molecular Probes, Eugene, OR); VWR (VWR Scientific, ); 
Advanced Biotechnologies (Advanced Biotechnologies, INC., Columbia, MD); and 
Perkin Ehner (also known as PE Biosytems and AppUed Biosystems, Foster City, CA). 

EXAMPLE 1 
Rapid screening of colonies for 5^ nuclease activity 

The native 5' nucleases and the enzymes of the present invention can be tested 
directly for a variety of functions. These include, but are not limited to, 5' nuclease 
activity on RNA or DNA targets and background specificity using altemative substrates 
representing structures that may be present in a target detection reaction. Examples of 
nucleic acid molecules having suitable test structures are shown schematically in Figures 
18A-D and Figures 21-24. The screening techniques described below were developed to 
quickly and efficiently characterize 5' nucleases and to determine whether the new 5' 
nucleases have any improved or desired activities. Enzymes that show improved cycling 
rates on RNA or DNA targets, or that resuh in reduced t^get-independent cleavage merit 
more thorough investigation. Li general, the modified proteins developed by random 
mutagenesis were tested by rapid colony screen on the substrates shown in Figures 18A 
and 18B. A rapid protein extraction was then done, and a test of activity on altemative 
structures, (e,g., as shown in Figures 18C-D) was performed using the protein extract. 
Either the initial screen, or fiirther screening and characterization of enzymes for 
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improved activity may be performed using other cleavage complexes, such as those 
diagrammed in Figures 21-24. It is not intended that the scope of the invention be limited 
by the particular sequences used to form such test cleavage structures. One skilled in the 
art would understand how to design and create comparable nucleic acids to form 
5 analogous structures for rapid screening. 

This order of testing may be chosen to reduce the number of tests overall, to save 
time and reagents. The order of testing for enzyme function is not intended to be a 
limitation on the present invention. Those mutants that showed reasonable cycling rates 
with the RNA or DNA targets may then be cultured overnight, and a rapid protein 
10 extraction done. Altematively, any subset or all of the cleavage tests may be done at the 
same time. 

For convenience, each type of rapid screen may be done on a separate microti ter 
plate. For example, one plate may be set up to test RNA DEADER activity, one plate 
set up to test for DNA INVADER activity. As many as 90 different colonies may be 
1 5 screened on one plate. The colonies screened can be from a variety of sources, such as 
'f:J clones of unaltered (native) 5' nucleases, from one mutagenesis reaction {e.g., many 

5? colonies from a single plate) or from a variety of reactions (colonies selected from 

Q 

111 multiple plates) . 

Ideally, positive and negative controls should be run on the same plate as the 
Q 20 mutants, using the same preparation of reagents. One example of a good positive control 
is a colony containing the uimiodified enzyme, or a previously modified enzyme whose 
activity is to be compared to new mutants. For example, if a mutagenesis reaction is 
performed on the Taq DN RX HT construct (described below), the unmodified Taq DN 
RX HT construct would be chosen as the standard for comparing the effects of 
25 mutagenesis on enzymatic activity. Additional control enzymes may also be incorporated 
into the rapid screening test. For example, Tth DN RX HT (described below; unless 
otherwise specified, the TaqPol and TthPol enzymes of the following discussion refer to 
the DN RX HT derivative) may also be included as a standard for enzymatic activity 
along with the Taq DN RX HT. This would allow a comparison of any altered enzymes 
30 to two known enzymes having different activities. A negative control should also be run 
to determine the background reaction levels (i.e., cleavage or probe degradation due to 
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sources other than the nucleases being compared). A good negative control colony would 
be one containing only the vector used in the cloning and mutagenesis, for example, 
colonies containing only the pTrc99A vector. 

Two factors that may influence the number of colonies chosen from a specific 
5 mutagenesis reaction for the initial rapid screen are 1) total number of colonies obtained 
from the mutagenesis reaction, and 2) whether the mutagenesis reaction was site-specific 
or randomly distributed across a whole gene or a region of a gene. For example, if only 
5-10 colonies are present on the plate, all colonies can easily be tested. If hundreds of 
colonies are present, a subset of these may be analyzed. Generally 10-20 colonies are 
10 tested from a site-specific mutagenesis reaction, while 80 to 100 or more colonies are 
routinely tested fi-om a single random mutagenesis reaction. 
Il Where indicated, the altered 5' nucleases described in these experimental 

W examples were tested as detailed below. 

;fj 15 A. Rapid screen: INVADER activity on RN A target (Figure 18 A) 

;^ A 2X substrate mix was prepared, comprising 20 mM MOPS, pH 7.5, 10 mM 

i| MgS04, 200 mM KCl, 2 |liM FRET-probe ohgo SEQ ID NO:223 

(5'-Fl-CGCT-cy3-TCTCGCTCGC-3'), 1 juM INVADER oligo SEQ ID NO:224 
(5'-ACGGAACGAGCGTCTTTG-3'), and 4 nM RNA target SEQ ID NO:225 (5'-GCG 
20 AGC GAGA CAG CGA AAG ACG CUC GUU CCG U-3'). Five ^il of the 2X substrate 
mix were dispensed into each sample well of a 96 well microtiter plate (Low Profile 
MULTIPLATE 96, M.J. Research, Inc.). 

Cell suspensions were prepared by picking single colonies (mutants, positive 
control, and negative control colonies) and suspending each one in 20|al of water. This 
25 can be done conveniently in a 96 well microtiter plate format, using one well per colony. 

Five |il of the cell suspension was added to the appropriate test well such that the 
final reaction conditions were 10 mM MOPS, pH 7.5, 5 mM MgS04, 100 mM KCl, 1 ^iM 
FRET-probe oUgo, 0.5 )iM INVADER ohgo, and 2 nM RNA target. The wells were 
covered with 10 jal of Clear CHILLOUT 14 (M.J. Research, Inc.) liquid wax, and the 
30 samples were heated at 85°C for 3 minutes, then incubated at 59^C for 1 hour. After the 
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incubation^ the plates were read on a Cytofluor flourescense plate reader using the 
following parmieters: excitation 485/20, emission 530/30. 



B. Rapid screen: INVADER activity on DNA target (Figure 18B) 

5 A 2X substrate mix was prepared, comprising 20 mM MOPS, pH 7.5, 1 0 mM 

MgS04, 200 mM KCl, 2 ^iM FRET-probe oUgo SEQ ID NO:223 
(5'-Fl-CGCT-Cy3-TCTCGCTCGC-3'), 1 mM INVADER oligo SEQ ID NO:224 
(5'-ACGGAACGAGCGTCTTTG-3'X 1 nM DNA target SEQ ID NO:226 (5'-GCG 
i AGC GAGA CAG CGA AAG ACG CTC GTT CCG T-3 Five ^il of the 2X substrate 
10 mix was dispensed into each sample well of a 96 well microtiter plate (MJ Low Profile). 
tJL Cell suspensions were prepared by picking single colonies (mutants, positive 

Sf control and negative control colonies) and suspending them in 20 |li1 of water, generally 

CI 

HI in a 96 well microtiter plate format. 

121 5 |Lil of the cell suspension were added to the appropriate test well such that the 

15 final reaction conditions were 10 mM MOPS, pH 7.5, 5 mM MgS04, 100 mM KCl, 1 ^iM 

:^ FRET-probe ohgo, 0.5 |iM INVADER ohgo, and 0.5 nM DNA target. Wells were 

n covered with lOja 1 of Clear CHILLOUT 14 (MJ. Research, Inc.) liquid wax, and the 

m reactions were heated at 85*^C for 3 minutes, then incubated at 59°C for 1 hour. After the 

W hour incubation, the plate were read on a Cytofluor flourescan plate reader using the 

ri 

20 following parameters: excitation 485/20, emission 530/30, gain 40, reads per well 10. 

C. Rapid protein extraction (crude cell lysate) 

Those mutants that gave a positive or an unexpected result in either the RNA or 
DNA INVADER assay were further analyzed, specifically for background activity on the 

25 X-structure or the hairpin substrate (Figure 18C and D, respectively). A rapid colony 
screen format can be employed, as described above. By simply changing the substrate, 
tests for background or aberrant enzymatic activity can be done. Another approach 
would be to do a rapid protein extraction from a small ovemight culture of positive 
clones, and then test this crude cell lysate for additional protein function. One possible 

30 rapid protein extraction procedure is detailed below. Two to five ml of LB (containing 
the appropriate antibiotic for plasmid selection; See e.g., Maniatis, books 1,2 and 3) were 
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inoculated with the remaining volume of the 20 \il water-cell suspension and incubated at 
ST^'C overnight. About 1.4 ml of the culture were transferred to a 1.5 ml microcentrifuge 
tube, and microcentrifuged at top speed (e.g,, 14,000 rpm in an Eppendorf 5417 table top 
microcentrifuge), at room temperature for 3-5 minutes to pellet the cells. The supematant 
5 was removed, and the cell pellet was suspended in 100 i^l of TES buffer pH 7.5 (Sigma). 
Lysozyme (Promega) was added to a final concentration of 0.5|Lig/ial and samples were 
incubated at room temperature for 30 minutes. Samples were then heated at 70^C for 10 
minutes to inactivate the lysozyme, and the cell debris was pelleted by 
microcentrifugation at top speed for 5 minutes. The supematant was removed and this 
10 crude cell lysate was used in the following enzymatic activity assays. 

2 D. Rapid screen: background specificity X structure substrate (Figure 18C) 

r| Reactions were performed imder conditions as detailed above. One pi of crude 

ill cell lysate was added to 9 |il of reaction components for a final volume of 1 0 |^1 and final 

15 concentrations of 10 mM MOPS, pH 7.5, 5 mM MgS04, 100 mM KCl, 1 ^iM 
1 FRET-probe oligo (SEQ ID NO:223), 0.5|aM X-structure INVADER oligo SEQ ID 
R| NO:227 (5 '-ACGGAACGAGCGTCTTTCATCTGTCAATC-3 '), and 0.5 nM DNA 
SI target (SEQ ID NO:226). Wells were covered with 10 ^il of Clear CHILLOUT 14 (MJ. 
0 Research, Inc.) liquid wax, and the reactions were heated at 85°C for 3 minutes, then 

20 incubated at 59''C for 1 hour. After the incubation, the plates were read on a Cytofluor 
fluorescence plate reader using the following parameters: excitation 485/20, emission 
530/30, gain 40, reads per well 10. 

E. Rapid screen: background specificity hairpin substrate (Figure 18D) 

25 Reactions were performed under conditions as detailed above. One lal of crude 

cell lysate was added to 9 |al of reaction components for a final volume of 10 |j,l and final 
concentrations of 10 mM MOPS, pH 7.5, 5 mM MgS04, 100 mM KCl, 1 ^M 
FRET-probe oligonucleotide (SEQ ID NO:223), and 0.5 nM DNA target (SEQ ID 
NO:226), Wells were covered with 10 |il of Clear CHILLOUT 14 (MJ. Research, hic.) 

30 hquid wax, and the reactions were heated at 85°C for 3 minutes, then incubated at 59°C 
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for 1 hour. After the hour incubation, the plate were read on a Cytofluor plate reader 
using the following parameters: excitation 485/20, emission 530/30, gain 40, reads per 
well 10. 



5 F. Activity assays with IrTl and IdT targets (Figures 24) 

The 5' nuclease activities assays were carried out in 10 \il of a reaction containing 
10 mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% Nonidet P-40, 10 ng/ml tRNA, 100 mM 
KCl and 5 mM MgS04. The probe concentration (SEQ ID NO: 167) was 2 mM. The 
substrates (IrTl (SEQ ID NO: 228) or IdT (SEQ ID NO: 229) at 10 or 1 nM final 
1 0 concentration respectively) and approximately 20 ng of an enzyme, prepared as in 
Example 3, were mixed with the above reaction buffer and overlaid with CHILLOUT 
(MJ Research) liquid wax. Reactions were brought up to reaction temperature 57 °C, 
started by addition of MgS04, and incubated for 10 min. Reactions were then stopped by 
5 the addition of 1 0 ^il of 95% formamide containing 10 mM EDTA and 0.02% methyl 
S 1 5 violet (Sigma). Samples were heated to 90°C for 1 minute immediately before 
1 electrophoresis through a 20% denaturing acrylamide gel (19:1 cross-linked), with 7 M 
urea, and in a buffer of 45 mM Tris-borate, pH 8.3, 1.4 mM EDTA. Unless otherwise 
indicated, 1 \i\ of each stopped reaction was loaded per lane. Gels were then scamied on 
an FMBIO-100 fluorescent gel scanner (Hitachi) using a 505 nm filter. The fraction of 
20 cleaved product was determined from intensities of bands corresponding to uncut and cut 
subsfrate with FMBIO Analysis software (version 6.0, Hitachi). The fraction of cleavage 
product did not exceed 20% to ensure that measurements approximated initial cleavage 
rates. The turnover rate was defined as the number of cleaved signal probes generated 
per target molecule per minute under these reaction conditions (1/min). 

25 

G. Activity assays with X structure (X) and hairpin (HP) targets (Figures 22) 

The 5' nuclease activity assays were carried out in 10 p,l of a reaction containing 
10 mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% Nonidet P-40, 10 ng/ml tRNA, 100 mM 
KCl and 5 mM MgS04. Each oligo for formation of either the hairpin structure assembly 
30 (22A, SEQ ID NOS: 230 and 23 1) assembly or the X structure assembly (22B, SEQ ID 
NOS: 230-232) was added to a fmal concentration of 1 |jm, and approximately 20 ng of 
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test enzyme prepared as described in Example 3, were mixed with the above reaction 
buffer and overlaid with CHILLOUT (MJ Research) Uquid wax. Reactions were brought 
up to reaction temperature 60 °C, started by addition of MgS04, and incubated for 10 
min. Reactions were then stopped by the addition of 10 |al of 95% formamide containing 
10 mM EDTA and 0.02% methyl violet (Sigma). Samples were heated to WC for 1 
minute immediately before electrophoresis through a 20% denaturing acrylamide gel 
(19:1 cross-linked), with 7 M urea, and in a buffer of 45 mM Tris-borate, pH 8.3, 1.4 mM 
EDTA. Unless otherwise indicated, 1 nl of each stopped reaction was loaded per lane. 
Gels were then scanned on an FMBIO-100 fluorescent gel scanner (Hitachi) using a 505 
nm filter. The fraction of cleaved product was determined from intensities of bands 
corresponding to uncut and cut substrate with FMBIO Analysis software (version 6.0, 
Hitachi). The fraction of cleavage product did not exceed 20% to ensure that 
measurements approximated initial cleavage rates. The turnover rate was defined as the 
number of cleaved signal probes generated per target molecule per minute under these 
reaction conditions (1/min). 

H. Activity assays with human IL-6 target (Figure 10) 

The 5' nuclease activities assays were carried out in 10 ^il reactions containing 10 
mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% Nonidet P-40, 10 ng/ml tRNA, 100 mM 
KCl and 5 mM MgS04. Reactions comprising the DNA IL-6 substrate contained 0.05 
nM IL-6 DNA target (SEQ ID NO: 163) and 1 ^iM of each probe (SEQ ID NO: 162) and 
INVADER (SEQ ID NO: 161) oligonucleotides, and were carried out at 60°C for 30 min. 
Reactions comprising the IL-6 RNA target (SEQ ID NO: 160) were performed under the 
same conditions, except that the IL-6 RNA target concentration was 1 nM and the 
reactions were performed at 57°C for 60 mdn. Each reaction contained approximately 20 
ng of test enzyme, prepared as described in Example 3. 

I. Activity assays with synthetic r25mer target (Figure 23) 

Reactions comprising the synthetic r25mer target (SEQ ID NO: 233) were carried 
out under the same reaction conditions (10 mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% 
Nonidet P-40, 10 ^ig/ml tRNA, 100 mM KCl and 5 mM MgS04) and 1 ^M of each probe 
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(SEQ ID NO: 234) and INVADER (SEQ ID NO: 235) oligonucleotides, except that the 
r25mer target concentration was 5 nM and the reactions were performed at 58°C for 60 
min. Approximately20ngof each test enzyme was added to the reactions. Enzymes 
were prepared as described in Example 3. 

5 

Any of the tests described above can be modified to derive the optimal conditions 
for enzymatic activity. For example, enzyme titrations can be done to determine the 
optimal enzyme concentration for maximum cleavage activity, and lowest background 
^ J signal. By way of example, but not by way of limitation, many of the mutant enzymes 

10 were tested at 10, 20 and 40 ng amounts. Similarly, a temperature titration can also be 
5 incorporated into the tests. Since modifying the structure of a protein can alter its 

y temperature requirements, a range of temperatures can be tested to identify the condition 
r|| best suited for the mutant in question. Additionally, different buffer conditions may be 
5J tested. Since modifying the structure of a protein can alter its requirements for salts (e.g. , 
W 1 5 KCl), a range of salt (or other buffer component) concentrations can be tested to identify 
55 ' the conditions best suited for the mutant in question. In some embodiments, enzymes are 
y tested or used in conditions that do not comprise salt added as a buffer component 
ri (although some salt may be present from sample preparation or enzyme preparation), 
pi Examples of the results from such screens (using approximately 20 ng of the 

20 mutant enzyme) are shown in Tables 3-8, and Figures 12, 14, 15, 19, and 25. 



EXAMPLE 2 

Cloning and Expression of 5' nucleases of DNA polymerases and mutant 

polymerases 

25 

A. DNA polymerases of Thermus aquaticus and Thermus thermopMus 
1. Cloning of TaqPol and TthPol 

Type A DNA polymerases from eubacteria of the genus Thermus share extensive 
30 protein sequence identity (90% in the polymerization domain, using the Lipman-Pearson 
method in the DNA analysis software from DNAStar, WI) and behave similarly in both 
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polymerization and nuclease assays. Therefore, the genes for the DNA polymerase of 
Thermus aquaticus (TaqPol), Thermus thermophilus (TthPol) and Thermus scotoductus 
were used as representatives of this class. Polymerase genes from other eubacterial 
organisms, including, but not limited to, Escherichia coli. Streptococcus pneumoniae, 

5 Mycobacterium smegmatis, Thermus thermophilus, Thermus sp., Thermotoga maritima, 
Thermosipho africanus, and Bacillus stearothermophilus are equally smtable. 

a. Initial TaqPol Isolation: mutant TaqA/G 

The Taq DNA polymerase gene was amplified by polymerase chain reaction from 
10 genomic DNA from Thermus aquaticus, strain YT-1 (Lawyer et al, supra), using as 
primers the oligonucleotides described in SEQ ID NOS:236 and 237. The resulting 
b fragment of DNA has a recognition sequence for the restriction endonuclease £'coRl at 
S the 5' end of the coding sequence and a Bglll sequence at the 3 ' end of the coding strand. 
=P Cleavage with BgHl leaves a 5' overhang or "sticky end" that is compatible with the end 
5 1 5 generated by Bamm. The PCR-ampHfied DNA was digested with EcdSl and BamBl. 
* The 25 12 bp fragment containing the coding region for the polymerase gene was gel 
0 purified and then ligated into a plasmid that contains an inducible promoter. 
m In one embodiment of the invention, the pTTQl 8 vector, which contains the 

JJ hybrid trp4ac (tac) promoter, was used (M J.R. Stark, Gene 5:255 [1987]). The tac 
11 20 promoter is under the control of the E, coli lac repressor protein. Repression allows the 
synthesis of the gene product to be suppressed until the desired level of bacterial growth 
has been achieved, at which point repression is removed by addition of a specific inducer, 
isopropyl-b-D-thiogalactopyranoside (IPTG). Such a system allows the controlled 
expression of foreign proteins that may slow or prevent growth of transformants. 
25 Particularly strong bacterial promoters, such as the synthetic ?tac, may not be 

adequately suppressed when present on a multiple copy plasmid. If a highly toxic protein 
is placed under control of such a promoter, the small amount of expression leaking 
through, even in the absence of an inducer, can be harmful to the bacteria. In another 
embodiment of the invention, another option for repressing synthesis of a cloned gene 
30 product is contemplated. A non-bacterial promoter from bacteriophage T7, found in Ihe 
plasmid vector series pET-3, was used to express the cloned mutant Taq polymerase 
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genes (Studier and Moffatt, J. Mol. Biol., 189: 1 13 [1986]). This promoter initiates 
transcription only by T7 RNA polymerase. In a suitable strain, such as 
BL21(DE3)pLYS, the gene for the phage T7 RNA polymerase is carried on the bacterial 
genome under control of the lac operator. This arrangement has the advantage that 
5 expression of the multiple copy gene (on the plasmid) is completely dependent on the 
expression of T7 RNA polymerase, which is easily suppressed because it is present in a 
single copy. 

These are just two examples of vectors having suitable inducible promoters. 
Others are well known to those skilled in the art, and it is not intended that the improved 
1 0 nucleases of the present invention be limited by the choice of expression system. 
y. For ligation into the pTTQl 8 vector, the PGR product DNA containing the Tag 

y polymerase coding region (termed mutTa^ for reasons discussed below, SEQ ID 
W NO:238) was digested with EcoRI and Bglll and this fragment was ligated under 
m standard "sticky end" conditions (Sambrook et al. Molecular Cloning, Cold Spring 
W 1 5 Harbor Laboratory Press, Cold Spring Harbor, pp. 1 .63-1 .69 [1 989]) into the EcoSI and 
r BaniRl sites of the plasmid vector pTTQlS. Expression of this construct yields a 
S translational fusion product in which the first two residues of the native protein 
3 (Met- Arg) are replaced by three from the vector (Met- Asn-Ser), but the remainder of the 
O PCR product's protein sequence is not changed (SEQ ID NO:239). The construct was 
' ^ 20 transformed into the JM109 strain ofE. coli, and the transformants were plated under 
incompletely repressing conditions that do not permit growth of bacteria expressing the 
native protein. These plating conditions allow the isolation of gaies containing 
pre-existing mutations, such as those that result from the infideUty of Taq polymerase 
during the amplification process. 
25 Using this amplification/selection protocol, a clone was isolated containing a 

mutated Taq polymerase gene (mutra^). The mutant was first detected by its phenotype, 
in which temperature-stable 5' nuclease activity in a crude cell extract was normal, but 
polymerization activity was almost absent (approximately less than 1% of wild type Taq 
polymerase activity). Polymerase activity was determined by primer extension reactions. 
30 The reactions were carried out in 1 0 |al of buffer containing 1 0 mM MOPS, pH 7.5, 5 
mM MgS04, 100 mM KCl. In each reaction, 40 ng of enzyme were used to extend 10 
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|LiM (dT)25-3o primer in the preesnce of either 10 \xM poly (A)286 or 1 |uM poly (dA)273 
template, 45 |liM dTTP and 5 ^iM Fl-dUTP at 60^C for 30 minutes. Reactions were 
stopped with 10 )al of stop solution (95% formamide, 10 mM EDTA, 0.02% methyl violet 
dye). Samples (3 jul) were fractionated on a 15% denaturing acrylamide gel (19:1 
crossed-hnked) and the fraction of incorporated Fl-dUTP was quantitated using an 
FMBIO-100 fluorescence gel scanner (Hitachi) equipped with a 505 nm emission filter. 

DNA sequence analysis of the recombinant gene showed that it had changes in 
the polymerase domain resulting in two amino acid substitutions: an A to G change at 
nucleotide position 1394, which causes a Glu to Gly change at amino acid position 465 
(numbered according to the natural nucleic and amino acid sequences, SEQ ED NOS:153 
and 157), and another A to G change at nucleotide position 2260, which causes a Ghi to 
Arg change at amino acid position 754. Because the Ghi to Gly mutation is at a 
nonconserved position and because the Glu to Arg mutation alters an amino acid that is 
conserved in virtually all of the known Type A polymerases, the latter mutation is most 
likely the one responsible for curtailing the synthesis activity of this protein. The 
nucleotide sequence for the construct is given in SEQ ID NO:39. The enzyme encoded 
by this sequence is referred to as Taq A/G. 

b. Initial TthPol Isolation 

The DNA polymerase enzyme from the bacterial species Thermus thermophilus 
(Tth) was produced by cloning the gene for this protein into an expression vector and 
overproducing it in E, coli cells. Genomic DNA was prepared from 1 vial of dried 
Thermus thermophilus strain HB-8 from ATCC (ATCC #27634). The DNA polymerase 
gene was ampUfied by PGR using the following primers: 
5'-CACGAATTCCGAGGCGATGCTTCCGCTC-3' (SEQ ID NO:240) and 
5'-TCGACGTCGACTAACCCTTGGCGGAAAGCC-3' (SEQ ID NO:241), The 
resulting PGR product was digested with EcdBl and Sail restriction endonucleases and 
inserted into EcoBllSal I digested plasmid vector pTrc99G (described in Example 2C1) 
to create the plasmid pTrcTth-1. This Tth polymerase construct is missing a single 
nucleotide that was inadvertently omitted from the 5' oligonucleotide, resulting in the 
polymerase gene being out of frame. This mistake was corrected by site specific 
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mutagenesis of pTrcTth-l as described in Examples 4 and 5 using the following 
oligonucleotide: 5'-GCATCGCCTCGGAATTCATGGTC-3' (SEQ ID NO:242), to create 
the plasmid pTrcTth-2. The protein and the nucleic acid sequence encoding the protein 
are referred to as TthPol, and are listed as SEQ ID NOS:243 and 244 respectively. 

5 

c. Large Scale preparation of recombinant proteins 
The recombinant proteins were purified by the following technique which is 
derived from a Taq DNA polymerase preparation protocol (Bngelke et al. Anal. 
■ ; Biochem., 191 :396 [1990]) as follows. E. coli cells (strain JM109) containing either 
10 pTrc99 A TaqPol, pTrc99GTthPol were inoculated into 3 ml of LB containing 1 00 mg/ml 
ampicillin and grown for 16 hrs at 37°C. The entire overnight culture was inoculated into 
200 ml or 350 ml of LB containing 100 mg/ml ampicillin and grown at 37°C with 

2 vigorous shaking to an Aeoo of 0.8. ff TO (1 M stock solution) was added to a final 

ml 

^ concentration of 1 mM and growth was continued for 16 hrs at 37°C. 

m 

15 The induced cells were pelleted and the cell pellet was weighed. An equal 

volume of 2X DG buffer (100 mM Tris-HCl, pH 7.6, 0.1 mM EDTA) was added and the 
0 pellet was suspended by agitation. Fifty mg/ml lysozyme (Sigma) were added to 1 
j^l mg/ml final concentration and the cells incubated at room temperature for 15 min. 
!f Deoxycholic acid (10% sohition) was added dropwise to a final concentration of 0.2 % 
5 20 while vortexing. One volume of H2O and 1 volume of 2X DG buffer were added, and the 
resulting mixture was sonicated for 2 minutes on ice to reduce the viscosity of the 
mixture. After sonication, 3 M (NH4)2S04 was added to a fmal concentration of 0.2 M, 
and the lysate was centrifuged at 14000 x g for 20 min at 4^C. The supernatant was 
removed and incubated at TO^'C for 60 min at which time 10% polyethylimine (PEI) was 
25 added to 0.25%. After incubation on ice for 30 min., the mixture was centrifuged at 

14,000 X g for 20 min at 4°C. At this point, the supematant was removed and the protein 
precipitated by the addition of (NH4)2S04 as follows. 

Two volumes of 3 M (NH4)2S04 were added to precipitate the protein. The 
mixture was incubated ovemight at room temperature for 16 hrs centrifuged at 14,000 x g 
30 for 20 min at 4°C, The protein pellet was suspended in 0.5 ml of Q buffer (50 mM 
Tris-HCl, pH 8.0, 0.1 mM EDTA, 0.1% Tween 20). For the Mja FEN-1 preparation, 
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solid (NH4)2S04 was added to a final concentration of 3 M (-75% saturated), the mixture 
was incubated on ice for 30 min, and the protein was spun down and suspended as 
described above. 

The suspended protein preparations were quantitated by determination of the Km 
dialyzed and stored in 50% glycerol, 20 mM Tris HCl, pH8.0, 50 mM KCl, 0.5% Tween 
20, 0.5% Nonidet P-40, with 100 jxg/ml BSA. 

B. DNA polymerases of Thermus flliformis and Thermus scotoductus 
1. Cloning of Thermus fittformis and Thermus scotoductus 

One vial of lyophilized Thermus flliformis (Tfi) obtained from DSMZ (Deutsche 
Sammlung von Mikroorganismen und Zellculturen, Braunschweig, Germany, strain 
#4687) was rehydrated in 1 ml of Castenholz medium (DSMZ medium 86) and 
inoculated into 500 ml of Castenholz medium preheated to 50°C. The culture was 
incubated at 70°C with vigorous shaking for 48 hours. After growth, the cells were 
harvested by centrifugation at 8000 x g for 10 minutes, the cell pellet was suspended in 
10 ml of TE (10 mM TrisHCL, pH 8.0, 1 mM EDTA), and the cells were frozen at -20°C 
in 1 ml aliquots. A 1 ml aUquot was thawed, lysozyme was added to 1 mg/ml, and the 
cells were incubated at 23°C for 30 mmutes. A solution of 20% SDS (sodium dodecyl 
sulfate) was added to a final concentration of 0.5% followed by extraction with buffered 
phenol. The aqueous phase was ftirther extracted with 1:1 phenolxhloroform, and 
extracted a final time with chloroform. One-tenth volume of 3 M sodium acetate, pH 5.0 
and 2.5 volumes of ethanol were added to the aqueous phase and mixed. The DNA was 
pelleted by centrifugation at 20,000 x g for 5 minutes. The DNA pellet was washed with 
70% ethanol, air dried and resuspended in 200 |j,l of TE ^d used directly for 
ampUfication. Thermus scotoductus (Tsc, ATCC # 51532) was grown and genomic DNA 
was prepared as described above for Thermus filiformis. 

The DNA polymerase I gene from Tfi (GenBank accession #AF030320) could not 
be amplified as a single fragment. Therefore, it was cloned in 2 separate fragments into 
the expression vector pTrc99a. The 2 fragments overlap and share a Not I site which was 
created by infroducing a silent mutation at position 1308 of the Tfi DNA polymerase open 
reading frame (ORE) in the PCR oligonucleotides. The 3' half of the gene was amplified 
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using the Advantage cDNA PGR kit (Clonetech) with the following oligonucleotides; 
5'-ATAGCCATGGTGGAGCGGCCGCTCTCCCGG (SEQ ID NO:245) and 
5'-AAGCGTCGACTCAATCCTGCTTCGCCTCCAGCC (SEQ ID NO:246). Hie PGR 
product from this reaction was approximately 1200 base pairs in length. It was cut with 
5 the restriction enzymes Not I and Sal I, and the resulting DNA was ligated into pTrc99a 
cut with NotI and Sail to create pTrc99a-Tfi3'. The 5' half of the gene was amplified as 
described above using the following two primers; 

5'AATGGAATTGAGGGGAGTTTTTGAGGTGGAGG (SEQ ID NO:247) and 
5'-CCGGGAGAGCGGCCGCTCCAC (SEQ ID NO:248). The resulting 1300 base pair 
1 0 fragment was cut with restriction enzymes Eco RI and Not I and ligated into 

pTrc99a-Tfi3' cut with Notl and ZcoRI to produce pTrc99a-TfiPol, SEQ ID NO:249 (the 

D corresponding amino acid sequence is listed in SEQ ID NO:250). 

Q 

f5 The DNA polymerase I gene from Thermus scotoductus was amplified using the 

;f Advantage cDNA PGR kit (Clonetech) using the following two primers; 

m 1 5 5'-ACTGGAATTCCTGGCGGTCTTTGAGGCGAAG (SEQ ID NO:25 1) and 

f 5'-AACAGTCGACCTAGGCCTTGGCGGAAAGCC(SEQIDNO:252). The PGR 

2 product was cut with restriction enzymes Eco RI and Sal I and ligated into Eco RI, Sal I 

rj cut pTrc99a to create pTrc99a-TscPol SEQ ID NO:253 (the corresponding amino acid 

'SJ sequence is Usted in SEQ ID NO:254). 

W 20 

2. Expression and purification of Thermus filiformis and Thermus 
scotoductus 

Plasmids were transformed into protease deficient E. coli strain BL21 (Novagen) 
or strain JM109 (Promega Corp., Madison, WI) for protein expression. Flasks containing 

25 200 ml of LB containing 100 jiig/ml ampicillin were inoculated with either a single 

colony from an LB plate or fi-om a frozen stock of the appropriate strain. After several 
hours of growth at 3TC with vigorous shaking, cxiltures was induced by the addition of 
200 ^il of 1 M isothiopropyl-galatoside (IPTG). Growth at ?>TC was continued for 16 
hours prior to harvest. Cells were pelleted by centrifiigation at 8000 x g for 15 minutes 

30 followed by suspension of the cell pellet in 5 ml of TEN (10 mM TrisHCl, pH 8.0, 1 mM 
EDTA, 100 mM NaCl). 100 \i\ of 50 mg/ml lysozyme were added and the cells 
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incubated at room temperature for 15 minutes. Deoxycholic acid (10%) was added to a 
final concentration of 0.2%. After thorough mixing, the cell lysates were sonicated for 2 
minutes on ice to reduce the viscosity of the mixture. Cellular debris was pelleted by 
centrifugation at 4°C for 15 minutes at 20,000 x g. The supernatant was removed and 
incubated at 70°C for 30 min after which 10% polyethylimine (PEI) was added to 0.25%. 
After incubation on ice for 30 minutes, the mixture was centrifuged at 20,000 x g for 20 
min at 4°C. At this point, the supernatant containing the enzyme was removed, and the 
protein was precipitated by the addition of 1 .2 g of ammonium sulfate and incubation at 
4°C for 1 hour. The protein was pelleted by centrifugation at 4°C for 10 minutes at 
20,000 X g. The pellet was resuspended in 4 ml of HPLC Buffer A (50 mM TrisHCl, pH 
8.0, 1 mM EDTA). The protein was further purified by affinity chromatography using an 
Econo-Pac heparin cartridge (Bio-Rad) and a Dionex DX 500 HPLC instrument. Briefly, 
tiie cartridge was equilibrated with HPLC Buffer A, and the enzyme extract was loaded 
on the column and eluted with a linear gradient of NaCl (0-2 M) in the same buffer. Pure 
protein elutes between 0.5 and 1 M NaCl. The enzyme peak was collected and dialyzed 
in 50% glycerol, 20 mM Tris HCl, pH 8, 50 mM KCl, 0.5% Tween 20, 0.5% Nonidet 
P40, 100 mg/ml BSA. 

C. Generation of polymerase mutants with reduced polymerase activity but 
unaltered 5' nuclease activity 

All mutants generated in section C were expressed and purified as described in 
Example 2A1C. 

1. Modified TaqPol Genes: TaqDN 

A polymerization deficient mutant of Tag DNA polymerase called TaqDN was 
constructed. TaqDN nuclease contains an asparagine residue in place of the wild-type 
aspartic acid residue at position 785 (D785N). 

DNA encoding the TaqDN nuclease was constructed firom the gene encoding the 
Taq A/G in two rounds of site-directed mutagenesis. First, the G at position 1397 and the 
G at position 2264 of the Taq A/G gene (SEQ ID NO:238) were changed to A at each 
position to recreate a wild-type TaqPol gene. In a second round of mutagenesis, the wild 
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type TaqPol gene was converted to the Taq DN gene by changing the G at position 2356 
to A. These manipulations were performed as follows. 

DNA encoding the Taq A/G nuclease was recloned from pTTQlS plasmid into 
the pTrc99A plasmid (Pharmacia) in a two-step procedure. First, the pTrc99A vector 
was modified by removing the G at position 270 of the pTrc99A map, creating the 
pTrc99G cloning vector. To this end, pTrc99A plasmid DNA was cut with Ncol and the 
recessive 3' ends were fiUed-in using the Klenow fragment of E.coli polymerase I in the 
presence of all four dNTPs at 37°C for 15 min. After inactivation of the Klenow 
fragment by incubation at 65°C for 10 min, the plasmid DNA was cut with EcoRl and the 
ends were again fiUed-in using the Klenow fragment in the presence of all four dNTPs at 
37°C for 15 min. The Klenow fragment was then inactivated by incubation at 65°C for 
10 min. The plasmid DNA was ethanol precipitated, recircularized by ligation, and used 
to transform E.coli JM109 cells (Promega). Plasmid DNA was isolated from single 
colonies, and deletion of the G at position 270 of the pTrc99A map was confirmed by 
DNA sequencing. 

In a second step, DNA encoding the Taq A/G nuclease was removed from the 
pTTQlS plasmid using EcoRl and Sail and the DNA fragment carrying the Taq A/G 
nuclease gene was separated on a 1% agarose gel and isolated with Geneclean II Kit (Bio 
101, Vista, CA). The purified fragment was ligated into the pTrc99G vector that had 
been cut with EcoRl and Sail. The ligation mixture was used to transform competent 
E.coli JM109 cells (Promega). Plasmid DNA was isolated from single colonies and 
insertion of the Taq A/G nuclease gene was confirmed by restriction analysis using 
EcoRI and Sail. 

Plasmid DNA pTrcAG carrying the Taq A/G nuclease gene cloned into the 
pTrc99A vector was purified from 200 ml of JM109 overnight culture using QIAGEN 
Plasmid Maxi kit (QIAGEN, Chatsworth, CA) according to manufacturer's protocol. 
pTrcAG plasmid DNA was mutagenized using two mutagenic primers, E465 (SEQ ID 
NO:255) (Integrated DNA Technologies, Iowa) and R754Q (SEQ ID NO:256) 
(Integrated DNA Technologies), and the selection primer Trans Oligonucleotide 
AlwNI/Spel (Clontech, Palo Alto, CA, catalog #6488-1) accordmg to TRANSFORMER 
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Site-Directed Mutagenesis Kit protocol (Clontech, Palo Alto, CA) to produce a restored 
wild-type TaqPol gene (pTrcWT). 

pTrcWT plasmid DNA carrying the wild-type TaqPol gene cloned into the 
pTrc99A vector was purified from 200 ml of JM109 overnight culture using QIAGEN 
Plasmid Maxi kit (QIAGEN, Chatsworth, CA) according to manufacturer's protocol. 
pTrcWT was then mutagenized using the mutagenic primer D785N (SEQ ID NO:257) 
(Integrated DNA Technologies) and the selection primer Switch Oligonucleotide 
Spel/AlwNI (Clontech, Palo Alto, CA, catalog #6373-1) according to TRANSFORMER 
Site-Directed Mutagenesis Kit protocol (Clontech, Palo Alto, CA) to create a plasmid 
containing DNA encoding the Taq DN nuclease. The DNA sequence encoding the Taq 
DN nuclease is provided in SEQ ID NO:258; the amino acid sequence of Taq DN 
nuclease is provided in SEQ ID NO:259. 

2. Modified TthPol Gene: Tth DN 

The Tth DN construct was created by mutating the TthPol described above. The 
sequence encoding an aspartic acid at position 787 was changed by site-specific 
mutagenesis as described above to a sequence encoding asparagine. Mutagenesis of 

pTrcTth-2 with the following oligonucleotide: 

5'-CAGGAGGAGCTCGTTGTGGACCTGGA-3' (SEQ ID NO:260) was performed to 
create the plasmid pTrcTthDN. The mutant protein and protein coding nucleic acid 
sequence is termed TthDN SEQ ID NOS:261 and 262 respectively. 

3. Taq DN HT and Tth DN HT 

Six amino acid histidine tags (his-tags) were added onto the carboxy termini of 
Taq DN and Tth DN. The site-directed mutagenesis was performed using the 
TRANSFORMER Site Directed Mutagenesis Kit (Clontech) according to the 
manufacturer's instructions. The mutagenic oligonucleotides used on the plasmids pTaq 
DN and pTth DN were sequence 1 17-067-03, 

5'-TCTAGAGGATCTATCAGTGGTGGTGGTGGTGGTGCTCCTTGGCGGAGAGC- 
3' (SEQ ID NO:263) and 

5'-TGCCTGCAGGTCGACGCTAGCTAGTGGTGGTGGTGGTGGTGACCCTTGGCG 
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GAAAGCC-3' (SEQ ID NO:264), sequence 136-037-05. The selection primer Trans 
Oligo AlwNI/Spel (Clontech, catalog # 6488-1) was used for both mutagenesis reactions. 
The resulting mutant genes were termed Taq DN HT (SEQ ID NO:265, nucleic acid 
sequence; SEQ ID NO:266, amino acid sequence) and Tth DN HT (SEQ ID NO:267, 
5 nucleic acid sequence; SEQ ID NO:268, amino acid sequence). 

4. Purification of Taq DN HT and Tth DN HT 

Both Taq DN HT and Tth DN HT proteins were expressed in E. coli strain JM109 
as described in Example 2B2. After ammonium sulfate precipitation and centrifiigation, 
10 the protein pellet was suspended in 0.5 ml of Q buffer (50 mM Tris-HCl, pH 8.0, 0.1 mM 
EDTAm 0. 1 % Tween 20). The protein was further purified by affinity chromatography 
b using His-Bind Resin and Buffer Kit (Novagen) according to the manufacturer's 
% instructions. 1 ml of His-Bind resin was transferred into a column, washed with 3 
;f column volumes of sterile water, charged with 5 volumes of IX Charge Buffer, and 
i3 15 equilibrated with 3 vokimes of IX Binding Buffer. Four ml of IX Bindmg Buffer was 
added to the protein sample and the sample solution was loaded onto the column. After 
P washing with 3 ml of IX Binding Buffer and 3 ml of IX Wash Buffer, the bound His-Tag 
fj protein was ehited with 1 ml of IX Elute Buffer. The pure enzyme was then dialyzed in 
1 50% glycerol, 20 mM Tris-HCl, pH 8.0, 50 mM KCl, 0.5% Tween 20, 0.5% Nonidet 
20 P40, and 100 ^g.ml BSA. Enzyme concentrations were determined by measuring 
absorption at 279 mn. 

EXAMPLE 3 

25 RNA-dependent 5' nuclease activity of TthPoI can be conferred on TaqPol by 
transfer of the N-terminal portion of the DNA polymerase domain 

A. Preparation and purification of substrate structures having either a DNA or 
an RNA target strand 

30 The downstream (SEQ ID NO:162) and upstream probes (SEQ ID N0:161) and 

the IL-6 DNA (SEQ ID NO: 163) (Figure 10) target strand were synthesized on a 
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PerSeptive Biosystems instrument ixsing standard phosphoramidite chemistry (Glen 
Research). The synthetic RNA-DNA chimeric L:T target labeled with biotin at the 5'-end 
(Figure 20A) was synthesized utilizing 2'-ACE RNA chemistry (Dhatmacon Research). 
The 2 -protecting groups were removed by acid-catalyzed hydrolysis according to the 
manufacturer's instructions. The downstream probes labeled with 5 '-fluorescein (Fl) or 
5'-tetrachloro-fluorescein (TET) at their 5' ends were purified by reverse phase HPLC 
using a Resource Q column (Amersham-Pharmacia Biotech). The 648-nucleotide IL-6 
RNA target (SEQ ID NO:160) (Figure 10) was synthesized by T7 RNA polymerase 
runoff-transcription of the cloned fragment of human JL-6 cDNA (nucleotides 64-691 of 
the sequence pubUshed in May et al, Proc. Natl. Acad. Sci., 83:8957 [1986]) using a 
Megascript Kit (Ambion). All oligonucleotides were finally purified by separation on a 
20% denaturing polyacrylamide gel followed by excision and elution of the major band. 
OUgonucleotide concentration was determined by measuring absorption at 260 nm. The 
biotin labeled LrT target was incubated with a 5-fold excess of streptavidin (Promega) in a 
buffer containing 10 mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% NP-40 and 10 |ig/ml 
tRNA at room temperature for 10 min. 

B. Introduction of restriction sites to make chimeras 

The restriction sites used for formation of chimerical proteins, described below, 
were chosen for convenience. The restriction sites in the following example have been 
strategically placed to surround regions shown by crystal structure and other analysis to 
be functional domains {See, Figures 6, 7, and 19). Different sites, either naturally 
occurring or created via directed mutagenesis can be used to make similar constructs with 
other Type A polymerase genes from related organisms. It is desirable that the mutations 
all be silent with respect to protein function. By studying the nucleic acid sequence and 
the amino acid sequence of the protein, one can introduce changes in the nucleic acid 
sequence that have no effect on the corresponding amino acid sequence. If the nucleic 
acid change required affects an amino acid, one can make the alteration such that the new 
amino acid has the same or similar characteristics of the one replaced. If neither of these 
options is possible, one can test the mutant enzyme for fimction to determine if the 
nucleic acid alteration caused a change in protein activity, specificity or function. It is 
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not intended that the invention be limited by the particular restriction sites selected or 
introduced for the creation of the improved enzymes of the present invention. 

C. Generation of Tth DN RX HT and Taq DN RX HT 

Mutagenesis was performed to introduce 3 additional, unique restriction sites into 
the polymerase domain of both the Taq DN HT and Tth DN HT enzymes. Site-specific 
mutagenesis was performed using the Transformer Site-Directed Mutagenesis Kit from 
(Clonetech) according to manufacturer's instructions. One of two different selection 
primers, Trans Oligo AlwNI/Spel or Switch Oligo Spel/AlwNI (Clontech, Palo Alto CA 
catalog #6488-1 or catalog #6373-1) was used for all mutagenesis reactions described. 
The selection oligo used in a given reaction is dependent on the selection restriction site 
present in the vector. All mutagenic primers were synthesized by standard synthetic 
chemistry. Resultant colonies were expressed in E.coli strain JM109. 

The Not I sites (amino acid position 328) were created using the mutagenic 
primers 5'-gccgccaggggcggccgcgtccaccgggcc (SEQ ID NO:269) and 
5'-gcctgcaggggcggccgcgtgcaccggggca (SEQ ID NO:270) corresponding to the sense 
strands of the Taq DN HT and the Tth DN HT genes, respectively. The BstI (amino acid 
position 382) and Ndel (amino acid position 443) sites were introduced into both genes 
using sense strand mutagenic primes 5'-ctcctggacccttcgaacaccacccc (SEQ ID NO:271) 
and 5'-gtcctggcccatatggaggccac (SEQ ID NO:272). The mutant plasmids were 
over-expressed and purified using Qiagen QiaPrep Spin Mini Prep Kit (cat. # 27106). 
The vectors were tested for the presence of the restriction sites by DNA sequencing and 
restriction mapping. These constructs are termed Tth DN RX HT (DNA sequence SEQ 
ID NO:273; amino acid sequence SEQ ID NO:274) and Taq DN RX HT (DNA sequence 
SEQ ID NO:275; amino acid sequence SEQ ID NO:276). 

D. Chimeras 

The chimeric conslxucts shown in Figure 19 were created by exchanging 
homologous DNA fragments defiined by the restriction endonuclease sites EcoRI (E) and 
BamHI (B), common for both genes, the cloning vector site Sail (S) and the new sites, 
NotI (N), BstBI (Bs), Ndel (D) created at the homologous positions of both genes by site 
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directed mutagenesis. In generating these chimeric enzymes, two different pieces of 
DNA are Ugated together to yield the final construct. The larger piece of DNA that 
contains the plasmid vector as well as part of the Taq or Tth (or parts of both) sequence 
will be termed the 'Vector." The smaller piece of DNA that contains sequences of either 
the Taq or Tth (or parts of both) polymerase will be termed the "insert." 

All restriction enzymes were fi-om New England Biolabs or Promega and used in 
reactions with the accompanying buffer, according to the manufacturer's instructions. 
Reactions were done in 20 \il volume with about 500 ng of DNA per reaction, at the 
optimal temperature for the specified enzyme. More than one enzyme was used in a 
single reaction (double digest) if the enzymes were compatible with respect to reaction 
buffer conditions and reaction temperature. If the enzymes in question were not 
compatible with respect to buffer conditions, the enzyme requiring the lowest salt 
condition was used fnst. After the completion of that reaction, buffer conditions were 
changed to be optimal or better suited to the second enzyme, and the second reaction was 
perfomied. These are common restriction enzyme digest strategies, well known to those 
in the art of basic molecular biology (Maniatis, supra). 

The digested restriction fragments were gel isolated for optimal hgation 
efficiency. Two \i\ of lOX loading dye (50% glycerol, IX TAE, 0.5% bromophenol 
blue) were added to the 20 jal reaction. The entire volume was loaded and run on a 1%, 
IX TAE agarose gel containing 1 |li1 of a 1% ethidium bromide solution per 100 ml of 
agarose gel solution. The digested fi"agments were visualized under UV Ught, and the 
appropriate fragments (as detemiined by size) were excised from the gel. These 
fragments were then purified using the Qiagen Gel Extractio Kit, (cat # 28706) according 
to the manufacturer's instructions. 

Ligations were performed in a 10 |xl volume, using 400 units per reaction of T4 
DNA Ligase enzyme from New England Biolabs (catalog #202L), with the 
accompanying reaction buffer. Ligation reactions were done at room temperature for 1 
hour, with 1 jitl of each of the Qiagen-purified fragments (approximately 20-50 ng of each 
DNA, depending on recovery from the gel isolation). Ligation products were then 
transformed into E. coli strain JM 109 and plated onto an appropriate growth and 
selection medium, such as LB with 100|ig/ml of ampicillin to select for transformants. 
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For each ligation reaction, six transformants were tested to determine if the 
desired construct was present. Plasmid DNA was purified and isolated using the QiaPrep 
Spin Mini Prep Kit, according to manufacturer's instructions. The constructs were 
verified by DNA sequencing and by restriction mapping. 

Expression and purification of the chimeric enzymes was done as follows. 
Plasmids were transformed into E. coli strain JM109 (Promega). Log phase cultures (200 
ml) of JM109 were induced with 0.5 mM IPTG (Promega) and grown for an additional 
16 hours prior to harvest. Crude extracts containing soluble proteins were prepared by 
lysis of pelleted cells in 5 ml of 10 mM Tris-HCl, pH 8.3, ImM EDTA, 0.5mg/ml 
lysozyme during incubation at room temperature for 15 minutes. The lysate was mixed 
with 5 ml of 10 mM Tris-HCl pH 7.8, 50 mM KCl, 1 mM EDTA, 0.5% Tween 20, 0.5% 
Nonidet P-40, heated at 72°C for 30 minutes, and cell debris was removed by 
centrifugation at 12,000x g for 5 minutes. Final purification of the protein was done by 
affinity chromatograpy using an Econo-Pac heparin cartridge (Bio-Rad) and Dionex DX 
500 HPLC instrument. Briefly, the cartridge was equiUbrated with 50mM Tris-HCl pH 
8, 1 mM EDTA, and an enzyme extract dialyzed against the same buffer was loaded on 
the column and eluted with a linear gradient of NaCl (0-2 M) in the same buffer. The 
HPLC-purified protein was dialyzed and stored in 50% (vol/vol) glycerol, 20 mM 
Tris-HCl pH 8.0, 50 mM KCl, 0.5% Tween 20, 0.5% Nonidet P-40, and lOOpg/m BSA. 
The enzymes were purified to homogeneity according to SDS-PAGE, and the enzyme 
concentrations were determined by measuring absorption at 279 nm. 

1. Construction of TaqTth(N) and TthTaq(N) 

The first exchange that was performed involved the polymerase domains of the 
two enzymes. Separation of the nuclease domain (the N-terminal end of the protein) 
firom the polymerase domain (the C-terminal portion of the protein) was accompUshed by 
cutting both genes with the restriction endonucleases EcoRI and Notl. The 
approximately 900 base pair fi-agment fi-om the Tth DN RX HT gene was cloned into the 
homologous sites of the Taq DN RX HT gene, and the approximately 900 base pair 
fragment firom the Taq DN RX HT gene was cloned into the homologous sites of the Tth 
DN RX HT gene, yielding two chuneras, TaqTth(N) (DNA sequence SEQ ID NO:69; 
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amino acid sequence SEQ ID N0:2) which has the Taq DN RX HT 5' nuclease domain 
and the Tth DN RX HT polymerase domain, and TthTaq(N) (DNA sequence SEQ ID 
NO:70; amino acid sequence SEQ ID N0:3) which is made up of the Tth DN RX HT 5' 
nuclease domain and the Taq DN RX HT polymerase domain. 

2. Construction of TaqTth(N-B) 

The Taq DN RX HT construct was cut with the enzymes Ndel and BamHI and 
the larger, vector fragment was gel isolated as detailed above. The Tth DN RX HT 
construct was also cut with Ndel and BamHI and the smaller (approximately 795 base 
pairs) Tth fragment was gel isolated and purified. The Tth Ndel-BamHI insert was 
Ugated into the Taq Ndel-BamHI vector as detailed above to generate the TaqTth(N-B) 
(DNA sequence SEQ ID N0:71; amino acid sequence SEQ ID N0:4). 

3. Construction of TaqTth(B-S) 

The Taq DN RX HT construct was cut with the enzymes BamHI and Sail and the 
larger vector fragment was gel isolated as detailed above. The Tth DN RX HT construct 
was also cut with BamHI and Sail and the smaller (approximately 741 base pairs) Tth 
fragment was gel isolated and purified. The Tth BamHI-Sall insert was ligated into the 
Taq BamHI-Sall vector as detailed above to generate the TaqTth(B-S) (DNA sequence 
SEQ ID NO:72; amino acid sequence SEQ ID N0:5). 

4. Construction of TaqTth(N-D) 

The Taq DN RX HT construct was cut with the enzymes NotI and Ndel and the 
larger vector fragment was isolated as detailed above. The Tth DN RX HT construct was 
also cut with NotI and Ndel and the smaller (approximately 345 base pairs) Tth fragment 
was gel isolated and purified. The Tth Notl-Ndel insert was ligated into the Taq 
Notl-Ndel vector as detailed above to generate the TaqTth(N-D) (DNA sequence SEQ ID 
NO:73; amino acid sequence SEQ ID N0:6). 

5. Construction of TaqTth(D-B) 
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The Taq DN RX HT construct was cut with the enzymes Ndel and BamHI and 
the larger vector fragment was isolated as detailed above. The Tth DN RX HT construct 
was also cut with Ndel and BamHI and the smaller (approximately 450 base pairs) Tth 
fragment was gel isolated and purified. The Tth Ndel-BamHI insert was ligated into the 
Taq Ndel-BamHI vector as detailed above to generate the TaqTth(D-B) (DNA sequence 
SEQ ID NO:74; amino acid sequence SEQ ID N0:7). 

6. Construction of TaqTth(Bs-B) 

The Taq DN RX HT construct was cut with the enzymes BstBI and BamHI and 
the larger vector fragment was isolated as detailed above. The Tth DN RX HT construct 
was also cut with BstBI and BamHI and the smaller (approximately 633 base pairs) Tth 
fragment was gel isolated and purified. The Tth Ndel-BamHI insert was ligated into the 
Taq Ndel-BamHI vector as detailed above to generate TaqTth(Bs-B) (DNA sequence 
SEQ ID NO:75; amino acid sequence SEQ ID N0:8). 

7. Construction of TaqTth(N-Bs) 

The Taq DN RX HT construct was cut with the enzymes NotI and BstBI and the 
larger vector fragment was isolated as detailed above. The Tth DN RX HT construct was 
also cut with NotI and BstBI and the smaller (approximately 162 base pairs) Tth fragment 
was gel isolated and purified. The Tth Notl-BstBI insert was ligated into the Taq 
Notl-BstBI vector as detailed above to generate TaqTth(N-Bs) (DNA sequence SEQ ID 
NO:76; amino acid sequence SEQ ID N0:9). 

8. Construction of TthTaq(B-S) 

The Tth DN RX HT construct was cut with the enzymes BamHI and Sail and the 
larger vector fragment was isolated as detailed above. The Taq DN RX HT construct was 
also cut with BamHI and SaU and the smaller (approximately 741 base pairs) Tth 
Augment was gel isolated and purified. The Taq BamHI-Sall insert was hgated into the 
Tth Bamm-Sall vector as detailed above to generate the TthTaq(B-S) (DNA sequence 
SEQ ID NO:77; amino acid sequence SEQ ID NO:10). 
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9. Construction of Tth Taq(N-B) 

The Tth DN RX HT construct was cut with the enzymes NotI and BamHI and the 
larger vector fragment was isolated as detailed above. The Taq DN RX HT construct was 
also cut with NotI and BamHI and the smaller (approximately 795 base pairs) Tth 
fragment was gel isolated and purified. The Taq Notl-BamHI insert was ligated into the 
Tth Notl-BamHI vector as detailed above to generate the TthTaq(N-B) (DNA sequence 
SEQ ID NO:78; amino acid sequence SEQ ID NO: 11). 

The cleavage activities of these chimerical proteins were characterized as describe 
in Example 1, part A, and a comparison of the cleavage cyclitig rates on an KNA target is 
shown in Figure 12. As further discussed in the Description of the Invention, these data 
show that elements found in the central third of the TthPol protein are important in 
conferring the TthPol-like RNA-dependent cleavage activity on the chimerical proteins 
comprising portions of TaqPol. 

EXAMPLE 4 

Alterations influencing RNA-dependent 5' nuclease activity do not necessarily 
influence RNA-dependent DNA polymerase activity 

TthPol is known to have a more active RNA template dependent DNA 
polymerase than does the TaqPol (Myers and Gelfand, Biochemistry 30:7661 [1991]). 
To determine whether the RNA template dependent 5' nuclease activity of the Thermus 
DNA Pol I enzymes is related to their RNA-dependent polymerase activity, the D785N 
and D787N mutations used to create the polymerase-deficient versions of TaqPol and 
TthPol, respectively were reversed. Polymerase activity was similarly restored to the 
TaqTth (N) (DNA sequence SEQ ID NO:79; amino acid sequence SEQ ID NO: 12), 
TaqTth(N-B) (DNA sequence SEQ ID NO:80; amino acid sequence SEQ ID NO: 13), 
TaqTth(B-S) (DNA sequence SEQ ID N0:81; amino acid sequence SEQ ID N0:14) 
chimeras, and the TaqPol(W417L/G418K/ E507Q) (DNA sequence SEQ ID N0:82; 
amino acid sequence SEQ ID NO: 15) mutant proteins. 
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Polymerase function was restored in all the above mentioned enzyme mutants by 
inserting the BamHI to Sail fragment of the native, non-DN sequence into the selected 
chimera or mutant enzyme. For example, the mutant construct TaqTth(N-B) was cut 
with the restriction enzyme BamHI (approximate amino acid position 593) and the 
restriction enzyme Sail (approximate amino acid position 840). The larger vector 
fragment was gel purified as described in Example 3D. The native TaqPol construct was 
also cut with the restriction endonucleases BamHI and Sail, and the smaller insert 
fragment containing the native amino acid sequence was also gel purified. The insert 
fragment was then Ugated into the vector as detailed in Experimental Example 3D. 

The polymerase activities of these proteins were evaluated by extension of the 
dT25-35-oligonucleotide primer with fluorescein-labeled dUTP in the presence of either 
poly(dA) or poly(A) template. Primer extension reactions were carried out in 10 ^1 
buffer containing 10 mM MOPS, pH7.5, 5 mM MgS04, 100 mM KCl. Forty ng of 
enzyme were used to extend 10 |aM (dT)25-30 primer in the presence of either 10 pM 
poly(A)286 or 1 \M poly(dA)273 template, 45 |liM dTTP and 5 pM Fl-dUTP at 60°C for 30 
min. Reactions were stopped with 10 pi of stop solution (95% formamide, 10 mM 
EDTA, 0.02% methyl violet dye). Samples (3 jul) were fractionated on a 15% denaturing 
acrylamide gel and the fraction of incorporated Fl-dUTP was quantitated using an 
FMBIO-100 fluorescent gel scamer (Hitachi) equipped with a 505 nm filter as described 
above. 

As shown in Figure 16, the DNA-dependent polymerase activities are very similar 
for all constructs used in this experiment, whereas the RNA-dependent polymerase 
activities of TthPol, TaqTth(N) and TaqTth(B-S) are at least 6-fold higher than the 
activities of TaqPol, TaqTth(N-B) and the TaqPol W417L/G418K/E507Q mutant. From 
the analysis of these results, it can be concluded that the high RNA-dependent DNA 
polymerase activity of TthPol is determined by the C-terminal half of the polymerase 
domain (roughly, amino acids 593-830) and that the RNA-dependent 5* nuclease and 
polymerase activities are not related to each other, and are controlled by different regions. 



EXAMPLE 5 
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Specific point mutants in Taq DN RX HT developed from information from the 

chimeric studies 

The chimeric studies (Example 3, above) suggest that the part of the TthPol 
sequence determining its high RNA-dependent 5' nuclease activity comprises the 
Bstfil-BamHI region located approximately between amino acid 382 and 593. 
Comparison of the amino acid sequences between the BstBI and BamHI regions of Tth 
DN RX HT and Taq DN RX HT (SEQ ID NOS:165 and 164, respectively) revealed only 
25 differences (Figure 13), Among these, 12 amino acid changes were conservative 
while 13 of the differences resulted in a changes in charge. Since the analysis of the 
chimeric enzymes suggested that the critical mutations are located in both the BstBI-Ndel 
and the Ndel-BamHI regions of Tth DN RX HT, site specific mutagenesis was used to 
introduce the Tth DN RX HT specific amino acids into the BstBI-Ndel and Ndel-BamHI 
regions of the TaqTth(D-B) and the TaqTth(N-D) respectively. 

Six Tth DN RX HT specific substitutions were generated in the BstBI-Ndel 
region of the TaqTth(D-B) by single or double amino acid mutagenesis. Similarly, 12 
Tth DN RX HT specific amino acid changes were introduced at the homologous position 
of the Ndel-BamHI region of the TaqTth(N-D). 

Plasmid DNA was purified from 200 ml of JM109 overnight culture using 
QIAGEN Plasmid Maxi Kit (QIAGEN, Chatsworth, CA) according to the manufacturer's 
protocol to obtain enough starting material for all mutagenesis reactions. All site specific 
mutations were introduced using the Transformer Site Directed mutagenesis Kit 
(Clontech) according to the manufacturer's protocol; specific sequence information for 
the mutagenic primers used for each site is provided below. One of two different 
selection primers, Trans Ohgo AlwNI/Spel or Switch OUgo Spel/AlwNI (Clontech, Palo 
Alto, CA catalog #6488-1 or catalog #6373-1) was used for all mutagenesis reactions 
described. The selection ohgo used in a given reaction is dependent on the restriction site 
present in the vector. All mutagenic primers were synthesized by standard synthetic 
chemistry. Resultant colonies were E.coU strain JM109. 
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1. Construction of TaqTth(D-B) E404H (DNA sequence SEQ ID NO:83; 
amino acid sequence SEQ ID NO:16) 

Site specific mutagenesis was performed on pTrc99A TaqTth(D-B) DNA using 
the mutagenic primer 240-60-01 5'-gag gag gcg ggg cac egg gcc gcc ctt-3' (SEQ E) 
5 NO:277) to introduce the E404H mutation. 

2. Construction of TaqTth(D-B) F413H/A414R (DNA sequence SEQ ID 
NO:84; amino acid sequence SEQ ID NO:17) 

Site specific mutagenesis was performed on pTrc99A TaqTth(D-B) DNA using 
10 the mutagenic primer 240-60-02 5'-ctt tec gag agg etc cat egg aac ctg tgg ggg agg-3' 
(SEQ ID NO:278) to introduce the F413H and the A4 MR mutations. 

3. Construction of TaqTtli(D-B) W417L/G418K (DNA sequence SEQ ID 
NO:85; amino acid sequence SEQ ID NO:18) 

1 5 Site specific mutagenesis was performed on pTrc99A TaqTth(D-B) DNA using 

the mutagenic primer 240-60-03 5 '-etc ttc gcc aac ctg ctt aag agg ctt gag ggg gag-3' 
(SEQ ID NO:279) to introduce the W417L and the G418K mutations. 



n 



4. Construction of TaqTth(D-B) A439R (DNA sequence SEQ ID NO:86; 
^ 20 amino acid sequence SEQ ID NO:19) 

Site specific mutagenesis was performed on pTrc99A TaqTth(ND-B) DNA using 
the mutagenic primer 240-60-04 5'-agg ccc ctt tec egg gtc ctg gcc cat-3' (SEQ ID 
NO:280) to introduce the A439R mutation. 



25 5. Construction of TaqTth(N-D) L451R (DNA sequence SEQ ID NO:87; 

amino acid sequence SEQ ID NO:20) 
Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-05 5'-acg ggg gtg cgc egg gac gtg gcc tat-3' (SEQ ID NO:281) 
to introduce the L415 mutation. 

30 
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6. Construction of TaqTth(N-D) R457Q (DNA sequence SEQ ID NO:88; 
amino acid sequence SEQ ID NO:21) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-06 5'-gtg gcc tat etc cag gcc ttg tec ctg-3' (SEQ ID NO:282) to 
5 introduce the L415Q mutation. 

7. Construction of TaqTth(N-D) V463L (DNA sequence SEQ ID NO:89; 
amino acid sequence SEQ ID NO:22) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-07 5'-ttg tec ctg gag ctt gcc gag gag atc-3' (SEQ ID NO:283) 
to introduce the V463L mutation. 

8. Construction of TaqTth(N-D) A468R (DNA sequence SEQ ID NO:90; 
amino acid sequence SEQ ID NO:23) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-08 5'-gcc gag gag ate cgc cgc etc gag gcc-3' (SEQ ID NO:284) 
to introduce the A468R mutation. 

9. Construction of TaqTth(N-D) A472E (DNA sequence SEQ ID NO:91 ; 
amino acid sequence SEQ ID NO:24) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-09 5'-gcc cgc etc gag gag gag gtc ttc cgc-3' (SEQ ID NO:285) 
to introduce the A472E mutation. 

25 10. Construction of TaqTth(N-D) G499R (DNA sequence SEQ ID NO:92; 

amino acid sequence SEQ ID NO:25) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-10 5'-ttt gac gag eta agg ctt ccc gcc atc-3' (SEQ ID NO:286) to 
introduce the G499R mutation. 

30 
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11. Construction of TaqTth(N-D) E507Q (DNA sequence SEQ ID NO:93; 
amino acid sequence SEQ ID NO:26) 

Site specific mutagenesis was perfonned on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 276-046-04 5'-atc gcc aag acg caa aag acc ggc aag-3' (SEQ ID 
NO:287) to introduce the E507Q mutation. 

12. Construction of TaqTth(N-D) Y535H (DNA sequence SEQ ID NO:94; 
amino acid sequence SEQ ID NO:27) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-11 5'-aag ate ctg cag cac egg gag etc acc-3' (SEQ ID NO:288) 
to introduce the Y535H mutation. 

13. Construction of TaqTth(N-D) S543N (DNA sequence SEQ ID NO:95; 
amino acid sequence SEQ ID NO:28) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-12 5'-acc aag ctg aag aac acc tac att gac-3' (SEQ ID NO:289) 
to introduce the S543N mutation. 

14. Construction of TaqTth(N-D) I546V (DNA sequence SEQ ID NO:96; 
amino acid sequence SEQ ID NO:29) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N[-D) DNA using the 
mutagenic primer 240-60-13 5'-aag age acc tac gtg gac ccc ttg ccg-3' (SEQ ID NO:290) 
to introduce the I546V mutation. 

15. Construction of TaqTth(N-D) D551S/I553V (DNA sequence SEQ ID 
NO:97; amino acid sequence SEQ ID NO:30) 

Site specific mutagenesis was performed on pTrc99AtaqTth(N-D) DNA using the 
mutagenic primer 240-60-14 5'-att gac ccc ttg ccg age etc gtc cac ccc agg acg ggc-3' 
(SEQ ID NO:291) to introduce the D551S and the I553V mutations. 
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16. Construction of TaqDN RX HT W417L/G418K/E507Q (DNA 
sequence SEQ ID NO:98; amino acid sequence SEQ ID NO:31) 

The TaqDN RX HT W417L/G418K/E507Q triple mutant was made by 
combining the TaqTth(D-B)W417L/G418K with the TaqTth(N-D) E507Q. 
5 TaqTth(D-B)W4 1 7L/G4 1 8K was cut with the restriction enzymes Ndel and BamHI, and 
the larger, vector fragment was isolated as detailed in Example 3. The TaqTth(N-D) 
E507Q construct was also cut with Ndel and BamHI and the smaller (approximately 795 
base pairs) fragment was gel isolated and purified as detailed in Example 3. The 
Ndel-BamHI insert was ligated into the gel purified vector, as detailed in Example 3. 

^ 10 

g 17. Construction of TaqDN RX HT W417L/E507Q (DNA sequence SEQ 

S ID NO:99; amino acid sequence SEQ ID NO:32) 

% Starting with TaqDN RX HT W417L/G418K/E507Q described above, mutagenic 

W primer 337-01-02: 5'-TTC GCC AAC CTG CTT GGG AGG CTT GAG GGG GAG -3' 
=0 1 5 (SEQ ID NO:292) was used in a site specific mutagenesis reaction to change the K at 
h amino acid position 418 back to the wild-type amino acid, G. Site specific mutagenesis 
2 was done using the Transformer Site Directed Mutagenesis Kit (Clonetech) according to 
gl the manufacturer's instructions, and as described in Experimental Example 4. 

20 18. Construction of TaqDN RX HT G418K/E507Q (DNA sequence SEQ 

ID NO:100; amino acid sequence SEQ ID NO:33) 

Starting with TaqDN RX HT W417L/G418K/E507Q described above, mutagenic 
primer 337-01-01: 5'-CTC TTC GCC AAC CTG TGG AAG AGG CTT GAG GGG -3' 
(SEQ ID NO:293) was used in a site specific mutagenesis reaction to change the L at 

25 amino acid position 417 back to the wild-type amino acid, W. Site specific mutagenesis 
was done using the Transformer Site Directed Mutagenesis Kit (Clonetech) according to 
the manufacturer's instructions, and as described in Experimental Example 4. 

Expression and purification of mutant proteins was done as detailed in Example 3, 
and the cleavage activities of these proteins were characterized as describe in Example 1, 

30 part A. A comparison of the cleavage cycling rates of a selection of these mutant 

proteins on an RNA target is shown in Figure 14. As fijrther discussed in the Description 
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of the Invention, these data show that amino acids in the regions 417/418 and amino acid 
507 are important in the conferring the TthPol-like RNA-dependent cleavage activity on 
the chimerical proteins comprising portions of TaqPol in combination with portions of 
TthPol that are not independently capable of providing enhanced RNA dependent activity 
5 (Le. , the D-B and N-D portions of Tth). As described in the Description of the Invention, 
Taq DN RX HT variant carrying only the W417L, G418K and E507Q substitutions were 
created. By comparing their cleavage rates to that of Tth DN RX HT on the IL-6 RNA 
pi substrate as described in Example 1, these mutations were determined to be sufficient to 
increase the Taq DN RX HT activity to the Tth DN RX HT level. Figure 15 shows that 
10 the Taq DN RX HT W417L/G418K/E507Q and Taq DN RX HT G418K/E507Q mutants 
hk have 1 .4 times higher activity than Tth DN RX HT and more than 4 fold higher activity 
si than Taq DN RX HT, whereas the Taq DN RX HT W417L/E507Q mutant has the same 
W activity as the enzyme, which is about 3 fold higher than Taq DN RX HT. These results 

At demonstrate that K418 and Q507 of TthPol are particularly important amino acids in 
1^ 1 5 providing RNA dependent 5' nuclease activity that is enhanced compared to TaqPol. 

0 

5 EXAMPLE 6 

ffl 

P RNA-dependent 5' nuclease properties of the Taq DN RX HT G418K/E507Q 5' 

Q 

f| nuclease are similar to Tth DN RX HT with respect to salt and temperature 

20 optima 

To determine if the G418K/E507Q mutations caused any significant changes in 
the properties of the Taq DN RX HT mutant in addition to the increased cleavage rate 
with the RNA target, the Taq DN RX HT G418K/E507Q (SEQ ID NO:33), Taq DN RX 

25 HT (SEQ ID N0276), and Tth DN RX HT (SEQ ID NO:274) enzymes were compared in 
the RNA template dependent 5* nuclease assay under conditions where temperature and 
the concentrations of salt and divalent ions were varied. The upstream DNA and the 
template RNA strands of the substrate used in this study were linked into a single IrT 
molecule (SEQ ID NO: 166) as shown in Figure 20 A, and the labeled downstream probe 

30 (SEQ ID NO: 167) was present in large excess. The 5' end of the target RNA strand was 
blocked with a biotin-streptavidin complex to prevent any non-specific degradation by 
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the enzyme during the reaction (Lyamichev et al. Science 260:778 [1993], Johnson et 
al. Science 269:238 [1995]). The cleavage rates for Taq DN RX HT G41 8K/E507Q, 
Taq DN RX HT, and Tth DN RX HT are plotted as functions of temperature in Figure 
20B. The closed circles represent enzyme Taq DN RX HT, the open circles represent 
5 enzyme Tth DN RX HT, and the Xs represent enzyme Taq DN RX HT G418K/'E507Q. 
The difference in the activities of Tth and Taq DN RX HT enzymes with the h"T substrate 
' is even greater than the difference found with the IL-6 RNA substrate when tested in a 
cleavage assay as described in Example 1. The G418K/E507Q mutations increase the 
activity of the Taq enzyme more than tenfold and by 25% compared with the Tth 
1 0 enzyme. All three enzymes show a typical temperature profile of the invasive signal 
amphfication reaction and have the same optimal temperature. No significant effect of 
p G4 1 8K/E507Q mutations on DNA dependent 5* nuclease activity of Taq DN RX HT with 

Jil the all-DNA substrate analogous to IrT (SEQ ID NO: 168) under the same conditions was 
found. 

y 15 The effects of KCl and MgS04 concentrations on the 5* nuclease activity of Taq 

f DN RX HT G41 8K/E507Q, Taq DN RX HT, and Tth DN RX HT with the IrT substrate 
Q are shown in Figure 20C and D. The activities of all enzymes have similar salt 
j4 dependencies with an optimal KCl concentration of 100 mM for Taq DN RX HT 
15 G41 8K/E507Q and Tth DN RX HT and 50 mM for Taq DN RX HT. The optimal 

n 20 MgS04 concentration for all enzymes is approximately 8 mM. The analysis of the data 
presented in Figure 20 suggests that the properties of Taq DN RX HT G418K/E507Q are 
much closer to those of Tth DN RX HT rather than Taq DN RX HT confirming the key 
role of the G418K/E507Q mutations in the recognition of the substrate with an RNA 
target. 

25 To understand the mechanism of the reduction of the 5' nuclease activity in the 

presence of an RNA versus a DNA target, the Michaelis constant (Km) and the maximal 
catalytic rate (kcat) of all three enzymes were determined, using an excess of the IrT 
substrate (SEQ ID NO:166) and the downstream probe (SEQ ID NO:167) and a limiting 
enzyme concentration. For these measurements, ten-jal reactions were assembled 

30 containing 10 mM MOPS, pH 7.5, 0.05% Tween 20, 0.05% Nonidet P-40, 10 |Lig/ml 

tRNA, 4 mM MgCla, 1 nM of enzyme (Taq DN RX HT, Tth DN RX HT, or Taq DN RX 
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HT G418K/E507Q) and different concentrations (0.125, 0.25, 0.5 or 1 |uM) of an 
equimolar mixture of the IrT target and the downstream probe. The cleavage kinetics for 
each enzyme and each substrate concentration were measured at 46°C. Reactions were 
stopped by the addition of 10 jiil of 95% formamide containing 10 roM EDTA and 0.02% 
methyl violet (Sigma). One |il of each stopped reaction digest was fractionated on a 20% 
denaturing acrylamide gel (19:1 cross-linked), with 7M urea, and in a buffer of 45 mM 
Tris-borate, pH 8.3, 1 .4mM EDTA. Gels were scanned on an FMBIO-1 00 fluorescent 
gel scanner (Hitachi) using a 585 nm filter. The fraction of cleaved product (determined 
from intensities of bands corresponding to uncut and cut substrate with FMBIO Analysis 
software, version 6.0, Hitachi) was plotted as a function of reaction time. The initial 
cleavage rates were determined from the slopes of linear part of the cleavage kinetics and 
were defined as the concentration of cut product divided by the enzyme concentration 
and the time of the reaction (in minutes). The Michaelis constant J^^m and the maximal 
catalytic rate A^at of each enzyme with frT substrate were determined from the plots of the 
initial cleavage rate as functions of the substrate concentration. 

It was found that all three enzymes have similar Km values (in the range of 
200-300 nM) and kcat values of approximately 4 min"^ for Taq DN RX HT and Tth DN 
RX HT and of 9 min"^ for Taq DN RX HT G418K/E507Q. That the G418K/E507Q 
mutations increase the kcat of Taq DN RX HT more than two fold, but have little effect on 
Km suggest that the mutations position the substrate in an orientation more appropriate for 
cleavage, rather than shnply increase the binding constant. 

EXAMPLE 7 

Use of molecular modeling to further improve RNA-dependent 5' nuclease activity 

A. Point mutants 

To develop enzymes with altered function, sequence changes were introduced by 
site-specific mutagenesis in predetermined locations or by random mutagenesis. 
Locations for site-specific mutagenesis were chosen based on evidence fi:om chimeric 
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studies, relevant published literature, and molecular modeling. Seven additional mutant 
enzymes were developed from the Tth DN RX HT enzyme, and twenty additional mutant 
enzymes were developed from the Taq DN RX HT enzyme, both discussed previously. 
Some of the mutant enzymes are the result of multiple mutagenesis reactions, that is, 
more than one change has been introduced to obtain the final product. Mutation reactions 
were done using the Tth DN RX HT construct (SEQ ED NO:273) described in Example 
2C2, or the Taq DN RX HT construct (SEQ ID NO:275), described in Example 2C1 
unless otherwise stated. Plasmid DNA was purified from 200 ml of JM109 overnight 
culture using QIAGEN Plasmid Maxi Kit (QIAGEN, Chatsworth, C A) according to the 
manufacturer's protocol to obtain enough starting material for all mutagenesis reactions. 
All site-specific mutations were introduced using the Transformer Site Directed 
mutagenesis Kit (Clontech) according to the manufacturer's protocol. One of two 
different selection primers, Trans Ohgo AlwNI/Spel or Switch Oligo Spel/AlwNI 
(Clontech, Palo Alto CA catalog #6488-1 or catalog #6373-1) was used for all 
mutagenesis reactions described. The selection oligo used in a given reaction is 
dependent on the restriction site present in the vector. All mutagenic primers for both the 
site-specific mutagenesis and the random mutagenesis were synthesized by standard 
synthetic chemistry. Resultant colonies for both types of reactions were Exoli strain 
JM109. Random mutagenesis methods are described below. 

Mutants were tested via the rapid screening protocol detailed in Example 1. 
Then, if more detailed analysis was desired, or if a larger protein preparation was 
required, expression and purification of mutant proteins was done as detailed in Example 
3. 

1. Construction of Tth DN RX HT H641 A, Tth DN RX HT H748A, Tth 
DNRXHT H786A 

Site specific mutagenesis was performed on pTrc99A Tth DN RX HT DNA using 
the mutagenic primer 583-001-02: 5'-gct tgc ggt ctg ggt ggc gat gtc ctt ccc ctc-3' (SEQ 
ID NO:294) to introduce the H641A mutation (DNA sequence SEQ ID NO: 101; amino 
acid sequence SEQ ID NO:34), or the mutagenic primer 583-001-03: 5' cat gtt gaa ggc 
cat ggc etc cgc ggc etc cct-3' (SEQ ED NO:295) to generate the H748A mutant (DNA 
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sequence SEQ ID NO: 102; amino acid sequence SEQ ED NO:35), or the mutagenic 
primer 583-001-04: 5'-cag gag gag etc gtt ggc gac ctg gag gag-3' (SEQ ID NO:296) to 
generate the H786A mutant enzyme (DNA sequence SEQ ID NO: 103; amino acid 
sequence SEQ ID NO:36). 

2. Construction of Tth DN RX HT (H786A/G506K/Q509K) 
Starting with the mutant Tth DN RX HT H786A, generated above, site specific 

mutagenesis was done using the mutagenic primer 604-022-02: 5'-gga gcg ctt gcc tgt ctt 
ctt cgt ctt ctt caa ggc ggg agg cct-3' (SEQ ID NO:297) to generate this variant termed 
"TthAKK", (DNA sequence SEQ ID NO: 104; amino acid sequence SEQ ID NO:37). 

3. Construction of Taq DN RX HT (W417L/G418K/E507Q/H784A) 
Mutagenic oligonucleotide 158-029-02: 5'-gag gac cag etc gtt ggc gac ctg aag gag 

cat-3' (SEQ ID NO:298) was used in a site specific mutagenesis reaction to introduce the 
H784A mutation and generate this construct termed "Taq4M" (DNA sequence SEQ ID 
NO:105; amino acid sequence SEQ ID NO:38). 

4. Construction of Taq4M H639A, Taq4M R587A, Taq4M G504K and 
Taq4M G80E 

Site specific mutagenesis was done on the Taq4M mutant, using primer 
473-010-11: 5'-gaggggcgggacatcgccacggagaccgccagc-3' (SEQ ID NO:299) to generate 
the Taq 4M H639A mutant (DNA sequence SEQ ID NO: 106; amino acid sequence SEQ 
ID NO:39), primer 473-010-10: 5'-cag aac ate ccc gtc gcc acc ccg ctt ggg cag-3' (SEQ 
ID NO:300) to generate Taq 4M R587A (DNA sequence SEQ ID NO: 107; amino acid 
sequence SEQ ID NO:40), primer 300-081-06: 5 '-ggg ctt ccc gcc ate aag aag acg gag aag 
acc-3' (SEQ ID NO:301) to generate Taq 4M G504K (DNA sequence SEQ ID NO: 108; 
amino acid sequence SEQ ID N0:41), and primer 330-088-04: 5'-cta ggg ctt ccc gcc ate 
aag aag acg caa aag acc ggc-3' (SEQ ID NO:302) to generate the Taq 4M G80E mutant 
(DNA sequence SEQ ID NO: 109; amino acid sequence SEQ ID NO:42). 

5. Construction of Taq 4M P88E/P90E and Taq 4M L109F/A110T 
Starting with Taq 4M described above, site specific mutagenesis was done using 

primer 473-087-03: 5'-ccg ggg aaa gtc etc etc cgt etc ggc ccg gcc cgc ctt-3' (SEQ ID 
NO:303) to generate the P88E/P90E mutations (DNA sequence SEQ ID NO: 110; amino 
acid sequence SEQ ID NO:43), or primer 473-087-05: 5 '-egg gac etc gag gcg cgt gaa ccc 
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cag gag gtc cac-3' (SEQ ID NO:304) to generate the L109F/A1 lOT mutations (DNA 
sequence SEQ ID N0:11 1; amino acid sequence SEQ ID NO:44). 
6. Construction of Taq DN RX HT 

(W417L/G418K/G499R/A502K/I503L /G504K/ E507K/H784A) 

5 Two PCR reactions were performed, first using construct Taq4M (Taq 

W417L/G418K/G504K/E507Q/H784A) as a template. Using primers 158-84-01 
5'-CTCCTCCACGAGTTCGGC-3' (SEQ ID NO:305) and 535-33-02 5'-ACC GGT 
CTT CTT CGT CTT CTT CAA CTT GGG AAG CCT GAG CTC GTC AAA-3' (SEQ 
ID NO:306) a 620 base pair PCR fragment was generated. Another 510 base pair PCR 
I 1 0 product was generated using primer 535-33-01 5'-AAG ACG AAG AAG ACC GGT 
U AAG CGC TCC ACC AGC-3' (SEQ ID NO:307) and 330-06-03 5'-GTC GAC TCT 
% AGA TCA GTG GTG GTG GTG GTG GTG CTT GGC CGC CCG GCG CAT C-3' 
ffl (SEQ ID NO:308). The two PCR products overlap such that a final recombinant PCR 
m amplification was done using the outside primers 158-84-01 and 330-06-03 to yield the 
15 11 82 base pair product. The recombinant PCR product was digested with the restriction 
^2 enzymes NotI and BamHI according to the manufacturer's instructions to yield a 793 

fl base pair firagment. The parent plasmid Taq4M was also digested with the same enzymes 

:J and used as the vector for ligation. All DNA fragments were TAB agarose gel purified 

O prior to ligation. The fragment was ligated into the vector, and transformed into JM109 

20 cells, thus incorporating the mutations G499R, A502K, I503L, and E507K as well as the 
restriction endonuclease site, Agel. This construct is termed "Taq 8M" (DNA sequence 
SEQ ID N0:112; amino acid sequence SEQ ID NO:45). 



B. Random Mutagenesis 

25 Numerous enzymes with altered function were generated via random 

mutagenesis. The regions of the protein targeted for random mutagenesis were chosen 
based on molecular modeling data and from information in the literature. Different 
mutagenic primers were used to introduce mutations into different regions of the protein. 
Random mutagenesis was performed on the Taq variant Taq 4M G504K (Taq DN RX 

30 HT W417L/G418K/G504K/E507Q/H784A/) (SEQ ID NO:108) described above and 
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mutant PGR fragments generated in the mutagenesis reaction were exchanged for 
homologous regions in TaqSM (SEQ JD N0:1 12) unless otherwise stated. 

Random mutagenesis was also performed on the Tth DN RX HT H786A (SEQ JD 
NO: 103) described above. Mutant PGR fragments generated with the Tth DN RX HT 
5 H786A template were exchanged for homologous regions in the unaltered Tth DN RX 
HT H786A. 



1. Random mutants in amino acid residues 500-507 or 513-520 

The first mutagenic oligonucleotide, 535-054-01 : 5'-gga gcg ctt acc ggt ctt (ttg 
10 cgt ctt ctt gat ctt ggg aag) cct tag etc gtc aaa gag-3 ' (SEQ JD NO:309) was used in 

conjunction with 158-84-01: 5'-CTC CTC CAC GAG TTC GGC-3' (SEQ ID NO:310) to 
^ install random residues from amino acid position 500 to 507 of Taq polymerase variant 

0 Taq DN RX HT W417UG418K/G504K/E507Q/H784A (SEQ ID NO:108). This was 

BB 

'"^ accomphshed by synthesizing the primer 535-054-01 such that only 91% of the bases 
61 15 within the parenthesis are unaltered while the remaining 9% of the bases are an equal 
v|4 mixture of the other 3 nucleotides. The initial, unaltered sequence of this oligo includes 
JL^ the G499R, A502K and the Q507K changes. 

n To generate mutations in the region 500-507, primer 535-054-01 and primer 

II 

Ji| 158-84-01 were used in a PGR reaction, using the Advantage cDNA PGR kit (Glonetech) 
y 20 and Taq variant described above, as the target. This PGR fragments was then run on a 
1% TAE agarose gel, excised and purified with QIAquick Gel Extraction Kit (Qiagen, 
Valencia GA, catalog # 28706). The purified fragment was cut with NotI and Agel and 
Ugated into pTaqSM that had been linearized with NotI and Agel. JM109 E.coH cells 
(Promega) were transformed with the ligated products. Clones were tested as described 
25 below. 

The second mutagenic oligonucleotide (used in a separate reaction) 535-054-02: 
5'-caa aag acc ggt aag cgc (tec acc age gcc gcc gtc ctg gag) gcc etc cgc gag gcc cac-3' 
(SEQ ID N0:31 1) was used in conjunction with 330-06-03: 5'-GTG GAG TGT AGA 
TGA GTG GTG GTG GTG GTG GTG GTT GGG GGG GGG GGG GAT G-3' (SEQ ID 
30 NO:312) to install random residues from amino acid 513-520. The bases within the 
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parenthesis of primer 535-054-02 are also 91% wild-type and 3% each of the other 3 
nucleotides. 

To generate mutations in the region 513-520, primer 535-054-02 and primer 
535-054-02 were used in a PGR reaction with Taq DN RX HT W417L/G418K/G504K/ 
5 E507Q/H784A (SEQ ID NO: 1 08) as template, as described above. The resulting PGR 
fragment was purified as above and cut with the restriction enzymes Agel and BamHL 
The cut fragment was then ligated into the TaqSM construct, also Unearized with Agel 
and BamHL JM109£.co/2 cells were transformed with the ligated products. Glones 
were tested as described Example 1. Mutants developed from these include: 

10 

Taq DN RX HT W417L/G418K/G499R/A502K/K504N/E507K/H784A (Ml-13) (DNA 
O sequence SEQ ID NO: 113; amino acid sequence SEQ ID NO:46). 
5 TaqDNRXHTW417L/G418K/G499R/L500I/A502K/G504KyQ507H/H784A (Ml-36) 
;| (DNA sequence SEQ ID NO: 114; amino acid sequence SEQ ID NO:47). 
W 15 TaqDNRXHTW417L/G418K/G499R/A502K/I503L/G504K/E507K/T514S/H784A 

(M2-24) (DNA sequence SEQ ID NO: 1 1 5; amino acid sequence SEQ ID NO:48). 
2 Taq DN RX HT W417L/G41 8K/G499R/A502K/I503L/G504K/E507K/ V5 1 8L/H784A 
fi (M2-06) (DNA sequence SEQ ID NO: 1 16; amino acid sequence SEQ ID NO:49). 
Q 2. TthDN RX HT H786A random mutagenesis 

^ 9 20 To generate mutants in the helix-hairpin-helix region of the TthDN RX HT 

H786A (SEQ ID NO:36) enzyme, two different PGR reactions were performed using the 

H786A (SEQ ID NO: 103) mutant as a template. The two PGR products overlap such that 

a recombinant PGR reaction can be performed (Higuchi, in PGR Technology, H. A. 

Erhch, ed., Stockton Press, New York. pp61-70 [1989]). This final PGR product is then 
25 exchanged with the homologous region of the TthDN H786A mutant by using restriction 

enzyme sites located on the ends of the fragment and within the TthDN H786A sequence. 
Starting with TthDN H786A discussed above, and using primer 604-08-06: 5'-gtc 

gga ggg gtc ccc cac gag-3' (SEQ ID NO:313) and primer 390-76-08: 5'-tgt gga att gtg 

age gg (SEQ ID N0:3 14), a 620 base pair PGR fragment was generated. PGR reactions 
30 were performed using the Advantage cDNA PGR kit (Glontech) according to 

manufacturer's instructions. This PGR product includes amino acids 1-194. No 
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mutations were introduced via this reaction, however the restriction enzyme site EcoRI is 
present at the 5' end. 

Starting with TthDN RX HT H786A discussed above, and using mutagenic 
primer 604-08-05: 5'-ctc gtg ggg gac ccc tec gac aac etc (ccc ggg gtc aag ggc ate ggg 
gag aag acc gcc) etc aag ctt etc aag-3' (SEQ ID NO:315) and primer 209-74-02: 5'-gtg 
gcc tec ata tgg gcc agg ac-3' (SEQ ID NO:316) a 787 base pair PCR fragment was 
generated. PCR reactions were done as above. This Jfragment does contain random 
mutations, due to the presence of the mutagenic primer, 604-08-05. The bases within the 
parenthesis of this primer were synthesized such that 91% of the sequence is wild-type, 
while the additional 9% is evenly divided between the remaining 3 bases. 

The two PCR fragments overlap, and were combined in a recombinant PCR 
reaction. Primers 390-76-08 and 209-74-02 were added, and the Advantage cDNA PCR 
kit (Clontech) was again used according to manufacturer's instructions. A 1380 base pair 
product was generated from this reaction. 

The recombinant PCR product was cut with the restriction enzymes EcoRI and 
NotI according to the manufacturer's instructions to yield a 986 base pair fragment. 
TthDN RX HT H786A was prepared by cutting with the same enzymes. The fragment 
was then ligated into the vector, and transformed into JM109 cells. New mutants 
developed from this set of reactions include: 

TthDN RX HT H786A/P197R/K200R (DNA sequence SEQ ID N0:1 17; amino acid 
sequence SEQ ID NO:50). 

TthDN RX HT H786A/K205Y (DNA sequence SEQ ID N0:1 18; amino acid sequence 
SEQIDN0:51). 

TthDN RX HT H786A/G203R (DNA sequence SEQ ID N0:1 19; amino acid sequence 
SEQ ID NO:52). 

3. Construction of Taq DN RX HT W417L/G418K/H784A 

L109F/A110T/G499R/A502K/I503L/G504K/E507Knr514S (Taq SS) 

Starting with Taq DN RX HT 
W417L/G418K/G499R/A502K/I503L/G504K/E507K/T514S/H784A (SEQ ID N0:115) 
mutant described above, primer 473-087-05: 5 '-egg gac etc gag gcg cgt gaa ccc cag gag 
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gtc cac-3' (SEQ ID NO:317) was used in conjunction with the appropriate selection 
primer in a site specific mutagenesis reaction to incorporate the L109F and Al lOT 
mutations to generate this enzyme, termed "TaqSS" (DNA sequence SEQ JD NO: 120; 
amino acid sequence SEQ ID NO:53). 

4. Construction of Taq DN RX HT W417L/G418L/H784A 
P88E/P90E/G499R/A502K/I503L/G504K/E507K/T514S 

Starting with Taq DN RX HT 
W417L/G418K/G499R/A502K/I503L/G504K/E507K/T514S/H784A (SEQ ID N0:115) 
mutant described above, primer 473-087-03: 5'-ccg ggg aaa gtc etc etc cgt etc ggc ccg 
gcc cgc ctt-3' (SEQ ID NO:318) was used in conjunction with the appropriate selection 
primer in a site specific mutagenesis reaction to incorporate the P88E and P90E 
mutations to generate this enzyme (DNA sequence SEQ ID NO: 121; amino acid 
sequence SEQ ID NO:54). 

5. TaqSS random mutagenesis 

Random mutagenesis was used to introduce additional changes in the 
helix-hairpin-helix domam of the TaqSS mutant (SEQ ID NO: 120). The mutagenesis 
was done as described in example 9 above. In the first step, two different but overlapping 
PGR products were generated. One of the PGR products, generated with oligos 
390-76-08 (SEQ ID NO:314), and 604-08-04: 5'-gtc gga etc gtc acc ggt cag ggc-3' (SEQ 
ID NO: 3 19) incorporates the EcoRI site into the fragment, but does not incorporate any 
mutations. The second PGR product utilizes mutagenic primer 604-08-03: 5'-ctg acc ggt 
gac gag tec gac aac ctt (ccc ggg gtc aag ggc ate ggg gag aag acg gcg) agg aag ctt ctg 
gag-3' (SEQ ID NO:320) and primer 209-74-02 (SEQ ID NO:316). This fragment 
contains random point mutations, and when combined via recombinant PGR with the first 
fragment, can be cut with the restriction enzymes EcoRI and NotI, and ligated into the 
TaqSS construct, also cut with EcoRI and Not! The ligated construct was then 
transformed into JM109. Golonies were screened as described below. Enzymes 
developed from this mutagenesis include: 

TaqSS K198N (DNA sequence SEQ ID N0:1 12; amino acid sequence SEQ ID NO:55). 
TaqSS A205Q (DNA sequence SEQ ID NO: 123; amino acid sequence SEQ ID NO:56). 
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TaqSS I200M/A205G (DNA sequence SEQ ID NO: 124; amino acid sequence SEQ ID 
NO:57). 

TaqSS K203N (DNA sequence SEQ ID NO: 125; amino acid sequence SEQ ID NO:58). 
TaqSS T204P (DNA sequence SEQ ID NO: 126; amino acid sequence SEQ ID NO:59). 

6. Construction of TaqSS R677A 

To generate enzymes with sequence changes in both the arch region and in the 
polymerase region, additional specific point mutations were generated in TaqSS. Site 
specific mutagenesis was performed as described above using the oligo 473-060-10: 
5'-tag etc ctg gga gag ggc gtg ggc cga cat gcc-3' (SEQ ID NO:321) to generate the 
TaqSS R677A mutant (DNA sequence SEQ ID NO: 127; amino acid sequence SEQ ID 
NO:60). 

7. Construction of TaqTthAKK (DNA sequence SEQ ID NO:128; amino 
acid sequence SEQ ID NO:61) and TthTaqSM (DNA sequence SEQ ID NO:129; 
amino acid sequence SEQ ID NO:62) 

Chimeric mutant TaqTthAKK and TthTaqSM were generated by cutting Tth DN 
RX HT (H786A/G506K/Q509K) (SEQ ID NO: 104; here abbreviated TthAKK) or Taq 
4M G504 (SEQ ID NO:108; here abbreviated Taq 5M) with the restriction endonucleases 
EcoRI and Notl. The smaller insert fragments as well as the larger vector firagments were 
gel purified as detailed in Example 3D, and the insert fi-agments were exchanged between 
the two mutants and ligated as described in Example 3D. Screening and verification of 
the construct sequence was also done as in Example 3D. 

EXAMPLE 8 

Improvement of RNA-dependent 5' nuclease activity in other polymerases 

Information gained from the TaqPol/TthPol recombinations, mutagenesis and 
modeling, was used to make comparable mutations in additional DNA polymerases and 
examined the effects on the cleavage activities of these enzymes. The DNA polymerases 
of Thermus filiformus (TfiPol) and Thermus scotoductus (TscPol) were cloned and 
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purified as described in Example 2. The mutagenesis of these two proteins is described 
below. 

A. Construction of TfiPolDN2M 

Mutagenesis of pTrc99a-TfiPol (SEQ ID NO:249) was done using the 
5 QuikChange site-directed mutagenesis kit (Stratagene) according to the manufacturer's 
protocol. The P420K mutation was made with the following two oligonucleotides; 
5'-CTTCCAGAACCTCTTTAAACGGCTTTCCGAGAAG (SEQ ID NO:322) and 
5'-CTTCTCGGAAAGCCGTTTAAAGAGGTTCTGGAAG (SEQ ED NO:323). The 
E507Q mutation was made with the following two ohgonucleotides; 
10 5'-CCGGTGGGCCGGACGCAGAAGACGGGCAAGC (SEQ JD NO:324) and 
U 5'-GCTTGCCCGTCTTCTGCGTCCGGCCCACCGG(SEQIDNO:325). TheD785N 
Pi mutation was made with the following two ohgonucleotides; 

5'-CTCCTCCAAGTGCACAACGAGCTGGTCCTGG (SEQ ID NO:326) and 
5 5'-CCAGGACCAGCTCGTTGTGCACTTGGAGGAG (SEQ ID NO:327). The plasmid 

1 5 containing all three mutations is called pTrc99a-TfiPolDN2M, (DNA sequence SEQ ID 
^ NO: 130; amino acid sequence SEQ ID NO:63). 

m B. Construction of TscPolDN2M 

}| Mutagenesis of pTrc99a-TscPol (SEQ ID NO:253) was done with the 

O QuikChange site-directed mutagenesis kit (Stratagene) according to the manufacturer's 

11 

20 protocol. The E416K mutation was made with the following two ohgonucleotides; 
5'-GCCGCCCTCCTGAAGCGGCTTAAGGG (SEQ ID NO:328) and 
5'-CCCTTAAGCCGCTTCAGGAGGGCGGC (SEQ ID NO:329). The E505Q mutation 
was made with the following two ohgonucleotides; 
5'-ATCGGCAAGACGCAGAAGACGGGCAAGC (SEQ ID NO:330) and 

25 5'-GCTTGCCCGTCTTCTGCGTCTTGCCGAT (SEQ ID NO:33 1). The D783N 
mutation was made with the following two oligonucleotides; 
5'-TTGCAGGTGCACAACGAACTGGTCCTC (SEQ ID NO:332) and 
5'-GAGGACCAGTTCGTTGTGCACCTGCAA (SEQ ID NO:333). The plasmid 
containing all three mutations is called pTrc99a-TscPoIDN2M, (DNA sequence SEQ ID 

30 N0:131; amino acid sequence SEQ ID NO:64). 

C. Chimerics of Tsc, Tfi, Tth and Taq mutants 
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1. Construction of TfiTth AKK (DNA sequence SEQ ID NO:132; amino 
acid sequence SEQ ID NO:65), TscTthAKK (DNA sequence SEQ ID 
NO:133; amino acid sequence SEQ ID NO:66), TfiTaqSM (DNA 
sequence SEQ ID NO:134; amino acid sequence SEQ ID NO:67) and 
5 TscTaqSM (DNA sequence SEQ ID NO: 135; amino acid sequence 

SEQ ID NO:68) 

To generate chimeric enzymes between Tth DN RX HT (H86A/G506KyQ509K) 
(here abbreviated TthAKK, SEQ ID NO: 104) or Taq 4M G504 (here abbreviated Taq 
5M, SEQ ID NO:108), and Tfi DN 2M (SEQ ID NO:130), or Tsc DN 2M (SEQ ID 
1 0 NO : 1 3 1 ), additional restriction endonuclease sites were introduced by site specific 

mutagenesis into the named Tfi and Tsc mutants. Mutagenic primers 700-01 1-01 5'-cag 
h acc atg aat tec ace cca ctt ttt gac ctg gag-3' (SEQ ID NO:334) and 700-01 1-02 5'-gtg gac 
m g^g g^^ ^g^ gg^ ^g^ ^g^ ^^g gg^ cag-3' (SEQ ID NO:335) were used to introduce an 
;|2 EcoRI site at amino acid position 1 and a NotI site at amino acid position 33 1 in Tfi DN 
y 15 2M. Mutagenic primers 700-01 1-03 5'-cag acc atg aat tec ctg ccc etc ttt gag ccc aag-3' 
^ (SEQ ID NO:336) and 700-01 1-04 5'-gta aac cgc gcc gcc cca ggc ggc ggc caa ggc gtt-3' 

Is 

O (SEQ ID NO:337) were used to introduce an EcoRI site at amino acid position 1 and a 
f| NotI site at amino acid position 327 in Tsc DN 2M. PCR reactions were done using the 
2 Advance cDNA PCR kit (Clonetech) according to manufacturer's instructions and either 
111 20 Tfi DN 2M or Tsc DN 2M as target, with their corresponding primers. The 1017 base 
pair PCR products were cut with both EcoRI and NotI to yield 993 base pair insert 
fragments that were gel purified as described in Example 3D. The mutants Taq4M 
G504K (SEQ ID NO: 108) and Tth DN RX HT (H786A/G506K/Q509K) (SEQ ID 
NO: 104) were also cut with EcoRI and NotI, and the larger, vector fragment was gel 
25 isolated as above. Ligations were performed as detailed in Example 3D, as was the 
screening and verification of the new constructs. 

EXAMPLE 9 

Additional Enzymes having Improved RNA-dependent 5* nuclease activity 

30 
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Generation of Tfi DN 2M(AN) 

To facilitate later cloning steps, an endogenous restriction enzyme site (Not I) was 
removed from the polymerase region of the TfiPolDN2M gene (SEQ ID NO: 130 
described in Example 8 A), and a unique Not I site was inserted in a more advantageous 
5 position. 

The endogeneous Not I site was removed as follows. The QUIKCHANGE 
Site-Directed Mutagenesis Kit from (Stratagene) was used according to manufacturer's 
instructions with the mutagenic primers S'-gag-gtg-gag-cgg-ccc-ctc-tcc-cgg-gtc-ttg (SEQ 
ID NO: 338) and 5'-caa-gac-ccg-gga-gag-ggg-ccg-ctc-cac-ctc (SEQ ID NO: 339). The 
10 new construct was named Tfi DN 2M(AN) (DNA sequence SEQ ID NO: 340; amino acid 
sequence SEQ ID NO: 341). 

i Generation of Tfi DN 2M(N), Tsc DN 2M(N) 

y To install a unique NotI site (at amino acid position 328) in Tfi DN 2M(AN) 

15 (SEQ ID NO: 340), primers 886-088-07 (SEQ ID NO: 342) 5'-tgg-cgg-cgg-cct-cgg-gcg- 
;Sj gcc-gcg-tcc-acc-ggg-caa-ca-3* and 700-010-03 5'-ctt-ctc-tca-tcc-gcc-aaa-aca-gcc (SEQ 
S ID NO: 343) were used in a PCR reaction with Tfi DN 2M(AN) (SEQ ID NO: 340) as 
1^ template. The resulting PCR fragment was purified and cut with the restriction enzymes 
W NotI and Sail. The cut fragment was then Ugated into the TfiTthAKK (SEQ ID NO: 132, 
20 described in example 8,C) constract which was also digested with NotI and Sail. The 
new constract was termed Tfi DN 2M(N) (DNA sequence SEQ ID NO: 345; amino acid 
sequence SEQ ID NO: 346). 

To introduce a Not I restriction endonuclease site into mutant TscPol DN 2M 
25 (previously described in Example 8B above), PCR was performed with primers: 886-088- 
05 (SEQ ID NO: 344) 5'-tgg-ccg-ccg-cct-ggg-gcg-gcc-gcg-ttt-acc-ggg-cgg-ag-3' and 
700-010-03 (SEQ ID NO: 343) 5'-ctt-ctc-tca-tcc-gcc-aaa-aca-gcc-3' using TscPolDN2M 
(SEQ ID NO: 1 3 1) as template. The Notl-Sall digested fragment of the PCR product was 
then sub-cloned into NotI and Sail digested TscTthAKK vector (SEQ ID NO: 133). The 
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resulting construct was termed Tsc DN 2M(N) (DNA sequence SEQ ID NO:347; amino 
acid sequence SEQ ID NO:348). 

Generation of Tfi DN 2M(N)AKK and Tsc DN2M(N)AKK 

To generate the "AKK" set of mutations (G504K/E507K/H784A/D785N), site 
specific mutagenesis was done on the Tfi DN 2M(N) mutant (DNA sequence SEQ ID 
NO: 345), using primers 959-022-01 to -04: 5'-ggc-ctc-acc-ccg-gtg-aag-cgg-acg-aag-aag- 
acg-ggc-aag-cgc-3', 5 '-gcg-ctt-gcc-cgt-ctt-ctt-cgt-ccg-ctt-cac-cgg-ggt-gag-gcc-3', 5 '-ctc- 
ctc-ctc-caa-gtg-gcc-aac-gag-ctg-gtc-ctg-3', 5'-cag-gac-cag-ctc-gtt-ggc-cac-ttg-gag-gag- 
gag-3' (SEQ ID NO: 354-357) to generate the Tfi 2M(N)AKK: mutant (DNA sequence 
SEQ ID NO: 358; amino acid sequence SEQ ID NO: 359). This construct is termed 
"TfiAKK". 

To install the "AKK" set of mutations (G502K/E505K/H782A/D783N), into the 
TscDN 2M (N) construct, (DNA sequence SEQ ID NO: 347) primers 959-022-05 to -08: 
5 ' -ggg-ctt-ccc-gcc-atc-aag-aag-acg-aag-aag-acg-ggc-aag-cgc-3 5 ' -gcg-ctt-gcc-cgt-ctt- 
ctt-cgt-ctt-ctt-gat-ggc-ggg-aag-ccc-3\ 5'-atg-ctt-ttg-cag-gtg-gcc-aac-gaa-ctg-gtc-ctc-3', 
5'-gag-gac-cag-ttc-gtt-ggc-cac-ctg-caa-aag-cat-3' (SEQ ID NO: 360-363) were used to 
generate Tsc2M(N)AKK (DNA sequence SEQ ID NO: 364; amino acid sequence SEQ 
ID NO: 365). This construct is termed "Tsc AKK". 

Construction of point mutants by recombinant PGR 

Construction of TthAKK(P195A) and TthAKK(P195K) 

To introduce mutations at amino acid position 195 (either a P 195 A or P195K) in 
the nuclease domain of TthAKK construct, mutagenic primer 785-073-01 (P195A) 5'- 
ccc-tcc-gac-aac-ctc-gcc-ggg-gtc-aag-ggc-atc-3' (SEQ ID NO: 370) or 785-073-02 
(P195K) 5'-ccc-tcc-gac-aac-ctc-aag-ggg-gtc-aag-ggc-atc-3' (SEQ ID NO: 371) and 
primer 209-074-02: 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) were used in 
a PGR reaction to generate a 787 base pair fragment. Another PGR fragment was 
obtained by using the primers: 390-076-08 5'4gt-gga-att-gtg-agc-gg-3' (SEQ ID 
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NO:314) and 785-073-03 5'-gag-gtt-gtc-gga-ggg-gtc-3' (SEQ ID NO: 372) in a reaction 
with the same template. 

The two PGR fragments overlap and were combined in a recombinant PGR 
reaction. The outside primers 390-076-08 and 209-074-02 were added, and the 
5 Advantage cDNA PGR kit (Glontech) was used according to manufacturer's instructions. 
A 1380 base pair product was generated from this reaction. 

The recombinant PGR product was cut with the restriction enzymes EcoRI and 
NotI to yield a 986 base pair fragment. The TthAKK construct was prepared by cutting 
with the same enzymes. The fragment was then Ugated into the vector, and transformed 
, 10 into JMl 09 cells. New mutants developed from this set of reactions include: 

TthAKK(P195A) (DNA sequence SEQ ID NO: 373; amino acid sequence SEQ ID NO: 
5 374) and TthAKK(P195K) (DNA sequence SEQ ID NO: 375; amino acid sequence SEQ 
S ID NO: 376). 

P[ Construction of TthAKK(N417K/L418K) 

15 The same approach was used to construct TthAKK(N417K/L418K). Two 

O overlapping PCR fragments were generated by mutagenic primers: 785-73-07 5'-gag- 
51 agg-ctc-cat-cgg-aag-aag-ctt-aag-cgc-ctc-gag-3' (SEQ ID NO: 377) and 700-10-03 5'-ctt- 
[S ctc-tca-tcc-gcc-aaa-aca-gcc-3' (SEQ ID NO:343), and primers 158-084-01 5'-ctc-ctc-cac- 
m gag-ttc-ggc-3' (SEQ ID NO:310) and 785-73-08 5'-ccg-atg-gag-cct-ctc-cga-3' (SEQ ID 
20 NO: 378). The two products were combined and amplified with outside primers 700-10- 
03 and 158-084-01 . The recombinant PCR product was cut with the restriction enzymes 
NotI and BamHI and Ugated into the Notl/BamHI pre-cut TthAKK construct. This 
mutant was termed TthAKK(N417K/L418K) (DNA sequence SEQ ID NO: 379; amino 
acid sequence SEQ ID NO: 380), 

25 

Coostruction of additional TthAKK point mutants by site-directed mutagenesis 

Construction of TthAKK(P255L) 

Site specific mutagenesis was performed on the TthAKK construct using the 
mutagenic primer 886-049-05 and 886-049-06: 5'-gtg-cgc-acc-gac-ctc-ctc-ctg-gag-gtg- 
30 gac-ctc-3' (SEQ ID NO: 381), 5'-gag-gtc-cac-ctc-cag-gag-gag-gtc-ggt-gcg-cac-3' (SEQ 



ID NO: 382) to generate TthAKK(P255L) (DNA sequence SEQ ID NO: 383; amino acid 
sequence SEQ ID NO: 384). 
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Constructioii of TthAKK(F311Y) 

Site specific mutagenesis was performed on the TthAKK construct using the 
mutagenic primer 886-049-09 and 886-049-10: 5'-ggg-gcc-ttc-gtg-ggc-tac-gtc-ctc-tcc- 
cgc-ccc-3' (SEQ ID NO: P385), 5'-ggg-gcg-gga-gag-gac-gta-gcc-cac-gaa-ggc-ccc-3' 
(SEQ ID NO: 386) to generate TthAKK(F31 lY) (DNA sequence SEQ ID NO: 387; 
amino acid sequence SEQ ID NO: 388). 



Construction of TthAKK(N221H/R224Q) 

Site specific mutagenesis was performed on the TthAKK construct using the 
mutagenic primer 886-049-01 and 886-049-02: 5'-gaa-aac-ctc-ctc-aag-cac-ctg-gac-cag- 
gta-aag-cca-gaa-aac-3' (SEQ ID NO: 389), 5'-gtt-ttc-tgg-ctt-tac-ctg-gtc-cag-gtg-ctt-gag- 
f 15 gag-gtt-ttc-3' (SEQ ID NO: 390) to generate TthAKK(N221H/R224Q) (DNA sequence 
SEQ ID NO: 391; amino acid sequence SEQ ID NO: 392). 



IJ Construction of TthAKK(R251H) 

Q Site specific mutagenesis was performed on TthAKK construct using the 

m 

'■ 20 mutagemc pnmer 886-049-03 and 886-049-04: 5'-gag-ctc-tcc-cgg-gtg-cac-acc-gac-ctc- 
ccc-ctg-3' (SEQ ID NO: 393), 5'-cag-ggg-gag-gtc-ggt-gtg-cac-ccg-gga-gag-ctc-3' (SEQ 
ID NO: 394) to generate TthAKK(R251H) (DNA sequence SEQ ID NO: 395; amino acid 
sequence SEQ ID NO: 396). 



25 Construction of TthAKK(P255L/R251H) 

Site specific mutagenesis was performed on the TthAKK(P255L) construct using 
the mutagenic primer 886-088-01 and 886-088-02: 5'-gag-ctc-tcc-cgg-gtg-cac-acc-gac- 
ctc-ctc-ctg-3' (SEQ ED NO: 397), 5'-cag-gag-gag-gtc-ggt-gtg-cac-ccg-gga-gag-ctc-3' 
(SEQ ID NO: 398) to generate TthAKK(P255L/R251H) (DNA sequence SEQ ID NO: 
30 399; amino acid sequence SEQ ID NO: 400). 
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Construction of Tth AKK L429V (DNA sequence SEQ ID NO: 401; amino 
acid sequence SEQ ID NO: 402) 

Palm region raQdomization mutagenesis was performed on the Tth AKK construct 
using the mutagenic primer 680-79-02 5 '-etc cat egg aac etc ctt [aag cgc etc gag ggg gag 
gag aag etc ctt tgg] etc tac cac gag gtg-3' (SEQ ID NO: 403) and reverse primer 680-79- 
04 5'-AAG GAG GTT CCG ATG GAG-3' (SEQ ID NO: 404) to introduce the random 
mutations in the Pahn region. Brackets indicate a synthesis of 91% base shown and 3% 
all other bases. 

Construction of Tth AKK E425V (DNA sequence SEQ ID NO: 405; amino 
acid sequence SEQ ID NO: 406) 

Palm region randomization mutagenesis was performed on the Tth AKK construct 
using the mutagenic primer 680-79-02 5 '-etc cat egg aac etc ctt [aag cgc etc gag ggg gag 
gag aag etc ctt tgg] etc tac cac gag gtg-3' (SEQ ID NO: 403) and reverse primer 680-79- 
04 5'-AAG GAG GTT CCG ATG GAG-3' (SEQ ID NO: 404) to introduce the random 
mutations in the Pahn region. Brackets indicate a synthesis of 91% base shown and 3% 
all other bases. 

Construction of Tth AKK L422N/E425K (DNA sequence SEQ ID NO: 407; 
amino acid sequence SEQ ID NO: 408) 

Palm region randomization mutagenesis was performed on the Tth AKK construct 
using the mutagenic primer 680-79-02 5 '-etc cat egg aac etc ctt [aag cgc etc gag ggg gag 
gag aag etc ctt tgg] etc tac cac gag gtg-3' (SEQ ID NO: 403) and reverse primer 680-79- 
04 5'-AAG GAG GTT CCG ATG GAG-3' (SEQ ID NO: 404) to mtroduce the random 
mutations in the Palm region. Brackets indicate a synthesis of 91% base shown and 3% 
all other bases. 
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Construction of Tth AKK L422F/W430C (DNA sequence SEQ ID NO: 409; 
amino acid sequence SEQ ID NO: 410) 

Palm region randomization mutagenesis was performed on Tth AKK DNA using 
the mutagenic primer 680-79-02 5'-ctc cat egg aac etc ctt [aag cgc etc gag ggg gag gag 
aag etc ctt tgg] etc tac cac gag gtg-3' (SEQ ID NO: 403) and reverse primer 680-79-04 
5'-AAG GAG GTT CCG ATG GAG-3' (SEQ ID NO: 404) to introduce the random 
mutations in the Palm region. Brackets indicate a synthesis of 91% base shown and 3% 
all other bases. 

Construction of Tth AKK A504F (DNA sequence SEQ ID NO: 411; amino 
acid sequence SEQ ID NO: 412) 

Site saturation mutagenesis was performed on Tth AKK constract using the 
mutagenic primer 680-80-03 5'-cag gag ctt agg ctt ccc nnn ttg aag aag acg aag aag aca-3' 
(SEQ ID NO: 413) and reverse primer 680-80-06 5'-cct aag etc gtc aaa gag-3' (SEQ ID 
NO: 414) to introduce the random mutations in the A504 amino acid. 

Construction of Tth AKK A504V (DNA sequence SEQ ID NO: 415; amino 
acid sequence SEQ ID NO: 416) 

Site saturation mutagenesis was performed on Tth AKK construct using the 
mutagenic primer 680-80-03 5'-cag gag ctt agg ctt ccc nnn ttg aag aag acg aag aag aca-3' 
(SEQ ID NO: 413) and reverse primer 680-80-06 5'-cct aag etc gtc aaa gag-3' (SEQ ID 
NO: 414) to introduce the random mutations in the A504 amino acid. 

Construction of Tth AKK A504S (DNA sequence SEQ ID NO: 417; amino 
acid sequence SEQ ID NO: 418) 

Site saturation mutagenesis was performed on Tth AKK construct using the 
mutagenic primer 680-80-03 5'-cag gag ctt agg ctt ccc nnn ttg aag aag acg aag aag aca-3' 
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(SEQ ID NO: 413) and reverse primer 680-80-06 5'-cct aag etc gtc aaa gag-3' (SEQ ID 
NO: 414) to introduce the random mutations in the A504 amino acid. 

Construction of Tth AKK S517G (DNA sequence SEQ ID NO: 419; amino 
acid sequence SEQ ID NO: 420) 

Site saturation mutagenesis was performed on Tth AKK. construct using the 
mutagenic primer 680-80-07 5'-GGC AAG CGC TCC ACC NNN GCC GCG GTG CTG 
GAG GCC CTA CGG-3' (SEQ ED NO: 421) and reverse primer 680-80-10 5'-GGT 
GGA GCG CTT GCC-3' (SEQ ID NO: 422) to introduce the random mutations in the 
S517 amino acid. 

Construction of Tth AKK A518L (DNA sequence SEQ ID NO: 423; amino 
acid sequence SEQ ID NO: 424) 

Site saturation mutagenesis was performed on Tth AKK construct using the 
mutagenic primer 680-80-07 5'-GGC AAG CGC TCC ACC AGC NNN GCG GTG CTG 
GAG GCC CTA CGG-3' (SEQ ID NO: 425) and reverse primer 680-80-10 5'-GGT 
GGA GCG CTT GCC-3' (SEQ ID NO:422) to introduce the random mutations in the 
A518 amino acid. 

Construction of Tth AKK A518R (DNA sequence SEQ ID NO: 426; amino 
acid sequence SEQ ID NO: 427) 

Site saturation mutagenesis was performed on Tth AKK construct using the 
mutagenic primer 680-80-07 5'-GGC AAG CGC TCC ACC AGC NNN GCG GTG CTG 
GAG GCC CTA CGG-3' (SEQ ID NO: 425) and reverse primer 680-80-10 5'-ggt gga 
gcg ctt gcc-3' (SEQ ID NO: 422) to introduce the random mutations in the A5 18 amino 
acid. 
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Construction of TaqSM L451R (DNA sequence SEQ ID NO: 428; amino acid 
sequence SEQ ID NO: 429) 

Site directed mutagenesis was performed on the Taq 5M construct using the 
5 mutagenic primer 240-60-05 5 '-acg-ggg-gtg-cgc-cgg-gac-gtg-gcc-tat 3 ' (SEQ ID NO: 
430) to introduce the L451R mutation in the Taq 5M enzyme. 



Construction of Tth AKK A504K (DNA sequence SEQ ID NO: 431; amino 
acid sequence SEQ ID NO: 432) 

1 0 Site directed mutagenesis was performed on the Tth AKK construct using the 

y mutagenic primer 680-69-04 5'-ctt agg ctt ccc aag ttg aag aag acg aag aag aca-3' (SEQ 
m ID NO: 433) and reverse primer 680-69-05 5 '-tgt ctt ctt cgt ctt ctt caa ctt ggg aag cct aag- 
m 3' (SEQ ID NO: 434) to introduce the A504K mutation in the Tth AKK enzyme. 

I 

Q 1 5 Construction of Tth AKK H641 A (DNA sequence SEQ ID NO: 435; amino 

i| acid sequence SEQ ID NO: 436) 

5j Site directed mutagenesis was performed on Tth AKK construct using the 

111 mutagenic primer 680-69-08 5'-gag ggg aag gac ate gcc acc cag acc gca agc-3' (SEQ ID 
NO: 437) and reverse primer 583-01-02 5'-gct tgc ggt ctg ggt ggc gat gtc ctt ccc etc- 3' 

20 (SEQ ID NO: 438) to introduce the H641A mutation in the Tth AKK enzyme. 



Construction of Tth AKK T508P (DNA sequence SEQ ID NO: 439; amino 
acid sequence SEQ ID NO: 440) 

Site directed mutagenesis was performed on TthAKK construct using the 
25 mutagenic primer 680-70-01 5'-ccc-gcc ttg aag aag ccg aag aag aca ggc aag-3' (SEQ ID 
NO: 441) and reverse primer 680-70-02 5'-ctt gcc tgt ctt ctt egg ctt ctt caa ggc ggg-3' 
(SEQ ID NO: 442) to introduce the T508P mutation in the Tth AKK enzyme. 

Chimeras and mutations in chimeras. 
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Fusion between TthAKK enzyme and alpha-peptide 

A TthAKK-lacZ-alpha-peptide chimeric fusion was constracted to allow detection 
of mutations (including frame-shifts, deletions, insertions, etc.) which cause the inability 
5 of expression of the full-length fusion protein based on the colony blue-white screening 
(Wu et al., Nucleic Acids Research, 24:1710 [1996]). 

Site specific mutagenesis was performed on TthAKK DNA using the mutagenic 
primers 959-041-03, 5'-cac-cac-cac-cac-cac-cac-gtc-gac-tag-tgc-tag-cgt-cga-cta-gct-gca- 
ggc-atg-caa-gct-tgg-c-3' (SEQ ID NO: 477) and 959-041-04, 5'-gcc-aag-ctt-gca-tgc-ctg- 
10 cag-cta-gtc-gac-gct-agc-act-agt-cga-cgt-ggt-ggt-ggt-ggt-ggt-g-3' (SEQ ID NO: 478) to 
generate pTthAKK-L with Sail site following the 6xHis tag at the C-terminus of TthAKK 
for the insertion of lacZ alpha peptide. The alpha peptide (of 201 amino acids) was first 
IS{ PGR amphfied jfrom the pCRII-TOPO vector (Invitrogen) with primers 959-041-01, 5'- 
Sj cag-gaa-gcg-gcc-gcg-tcg-aca-tga-cca-tga-tta-cgc-caa-gc-3' (SEQ ID NO: 479) and 959- 
5 15 093-01, 5'-ggg-ccc-gcc-agg-gtc-gac-tca-ggg-cga-tgg-ccc-act-acg-tga-3' (SEQ ID NO: 
'if 480). The PGR product was then digested with restriction enzyme Sail and hgated into 
s the pTthAKK-L vector, which was also cut with the same enzyme to generate the 

f| chimeric construct TthAKK-alpha peptide (DNA sequence SEQ ID NO: 481; amino acid 

sequence SEQ ID NO: 482). The orientation of the insert was confirmed by sequencing. 

P 20 

' Construction of TthTscAKK and TthTfiAKK enzymes 

The TthAKK construct was cut with the enzymes EcoRI and NotI and the smaller 
insert fragment was gel isolated. The TscAKK or TfiAKK constructs were also cut with 
EcoRI and NotI and the larger fragment was gel isolated and purified. The Tth insert 

25 (nuclease domain) was ligated into the TscAEX and TfiAKK vectors (polymerase 

domain) to generate TthTscAKK (DNA sequence SEQ ED NO: 447; amino acid sequence 
SEQ ID NO: 448) and TthTfiAKK (DNA sequence SEQ ID NO: 449; amino acid 
sequence SEQ ID NO: 450) chimeric constructs. 



30 FT mutations of Tth polymerase to improve substrate specificity 
Construction of Taq(FT)TtliAKK 
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Site specific mutagenesis was performed on the TaqTthAKK constract using the 
mutagenic primer 473-087-05: 5'-cgg-gac-ctc-gag-gcg-cgt-gaa-ccc-cag-gag-gtc-cac-3' 
(SEQ ID NO: 483) to introduce the L107F/A108T mutations and generate 
Taq(FT)TthAKK (DNA sequence SEQ ID NO: 484; amino acid sequence SEQ ID NO: 
5 485). 

Construction of Tfi(FT)TthAKK 

Site specific mutagenesis was performed on the TfiTthAKK construct using the 
mutagenic primers 785-096-01: 5'-gtg-gac-ctt-ctg-ggc-ttt-acc-cgc-ctc-gag-gcc-ccg-3' 
(SEQ ID NO: 486) and 785-096-02: 5'-cgg-ggc-ctc-gag-gcg-ggt-aaa-gcc-cag-aag-gtc- 
10 cac-3' (SEQ ID NO: 487) to introduce the L107FA''108T mutations and generate 

Tfi(FT)TthAKK (DNA sequence SEQ ID NO: 349; amino acid sequence SEQ ID NO: 
488). 

Tfi(FT) DN 2M(N) and Tsc(FT) DN 2M(N) mutants 
y 1 5 The LI 07FA^1 08T mutations were introduced by isolating the NotI and Sail 

fi:agment of the Tfi DN 2M(N) mutant (DNA sequence SEQ ID NO: 345) and inserting it 

n 

Q into a Notl-Sall pre-digested Tfi(FT)TthAKK (DNA sequence SEQ ID NO: 349; amino 
1=1 acid sequence SEQ ID NO: 488) to yield the Tfi(FT) DN 2M(N) mutant (DNA sequence 
J SEQ ID NO: 350, amino acid sequence SEQ ID NO: 351). To add the L107F or the 

n 20 E108T mutations into this Tsc-based construct, an identical procediure was done. The 

NotI and Sail cut fragement of Tsc DN 2M(N) mutant (DNA sequence SEQ ID NO: 348) 
was inserted into Notl-Sall pre-digested Tsc(FT)TthAKK (DNA sequence SEQ ID NO: 
491) vector to yield Tsc(FT) DN 2M(N) mutant (DNA sequence SEQ ID NO: 352, amino 
acid sequence SEQ ID NO: 353). 

25 

Construction of Tfi(FT) AKK and Tsc(FT) AKK 

Startmg with the Tfi(FT)DN2M(N) construct described previously (DNA 
sequence SEQ ID NO: 350), primers (959-022-01 to -04: 5'-ggc-ctc-acc-ccg-gtg-aag- 
cgg-acg-aag-aag-acg-ggc-aag-cgc-3', 5'-gcg-ctt-gcc-cgt-ctt-ctt-cgt-ccg-ctt-cac-cgg-ggt- 
30 gag-gcc-3', 5'-ctc-ctc-ctc-caa-gtg-gcc-aac-gag-ctg-gtc-ctg-3', 5'-cag-gac-cag-ctc-gtt-ggc- 
cac-ttg-gag-gag-gag-3' (SEQ ID NO: 354-357) were used to mtroduce the "AKK" set of 
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mutations (H784A, G504K and E507K) by site specific mutagenesis. The resulting 
mutant construct is termed Tfi(FT)AKK (DNA sequence SEQ ID NO:366, amino acid 
sequence SEQ ID NO:367). 

Likewise, primers 959-022-05 to -08: 5'-ggg-ctt-ccc-gcc-atc-aag-aag-acg-aag- 
5 aag-acg-ggc-aag-cgc-3', 5'-gcg-ctt-gcc-cgt-ctt-ctt-cgt-ctt-ctt-gat-ggc-ggg-aag-ccc-3', 5'- 
atg-ctt-ttg-cag-gtg-gcc-aac-gaa-ctg-gtc-ctc-3', 5'-gag-gac-cag-ttc-gtt-ggc-cac-ctg-caa-aag- 
cat-3' (SEQ ID NO: 360-363) were used in a site specific mutagenesis reaction, with the 
Tsc(FT)DN2M(N) mutant (DNA sequence SEQ ID NO: 352) as template to generate the 
Tsc(FT)AKK mutant (DNA sequence SEQ ID NO: 368; amino acid sequence SEQ ID 
10 NO: 369). 

Construction of Tsc(FT)TthAKK 
O Site specific mutagenesis was performed on the TscTthAKK construct using the 

mutagenic primers 785-008-03 5'-ttt-acc-cgc-ctc-gag-gtg-ccg-ggc-3' (SEQ ID NO: 489) 
and reverse primer 680-21-03 5'-cgg cac etc gag gcg ggt aaa gcc caa aag gtc cac-3' (SEQ 
15 ID NO: 490) to introduce the L107F/E108T mutations and generate Tsc(FT)TthAKK 
(DNA sequence SEQ ID NO: 454; amino acid sequence SEQ ID NO: 491). 
Construction of Taq(FT)TscAKK and Taq(FT)TfiAKK enzymes 

The Taq(FT)TthAKK construct was cut with the enzymes EcoRI and NotI and the 
smaller insert fragment was gel isolated. The TscAKK or TfiAKK constructs were also 
W 20 cut with EcoRI and NotI and the larger firagment was gel isolated and piuified. The 
Taq(FT) insert (nuclease domain) was ligated into the TscAKK and TfiAKK vectors 
(polymerase domain) to generate the Taq(FT)TscAKK (DNA sequence SEQ ID NO: 
443; amino acid sequence SEQ ID NO: 444) and Taq(FT)TfiAKK (DNA sequence SEQ 
ID NO: 445; amino acid sequence SEQ ID NO: 446) chimeric constructs. 
25 Construction of TaqEFT-Tth(AKK) (DNA sequence SEQ ID NO:501 ; amino 
acidsequence SEQ ID NO: 502) 
Site specific mutagenesis was performed on Taq(FT)-Tth(AKK) DNA (SEQ ID 
NO: 484) using the mutagenic primers 436-013-08: 5'-atc-gtg-gtc-ttt-gac-gcc-gag-gcc- 
ccc-tcc-ttc-c-3' (SEQ ID NO:503) and 436-013-09 5'-gga-agg-agg-ggg-cct-cgg-cgt-caa- 
30 aga-cca-cga-t-3 ' (SEQ ID NO:504) to introduce the K70E mutation. 



is* 
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Additional mutations to improve enzyme activity of Taq(EFT)TthAKK 



Construction of TaqEFT-Tth(AKK)-A/Ml (DNA sequence SEQ ID NO:505; amino 
acid sequence SEQ ID NO: 506) 

5 Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 

NO: 501) using the mutagenic primers 1044-038-01: 5'-cag-acc-atg-aat-tcg-gag-gcg-atg- 
ctg-ccc-ctc-ttt-3' (SEQ ID NO:507) and 1044-038-02: (SEQ ID NO: 508) 
5 '-aaa-gag-ggg-cag-cat-cgc-ctc-cga-att-cat-ggt-ctg-3 ' to introduce the G4EA mutation. 

10 Construction of TaqEFT-Tth(AKK)-B/M2 (DNA sequence SEQ ID NO:509; 
amino acid sequence SEQ ID NO: 510) 

H Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 

Q NO: 501) using the mutagenic primers 1044-038-03: 5'-gcc-tac-cgc-acc-ttc-ttt-gcc-ctg- 
? aag-ggc-ctc-3 and 1044-038-04: 5'-gag-gcc-ctt-cag-ggc-aaa-gaa-ggt-gcg-gta-ggc-3' (SEQ 
01 15 ID NO: 51 1 and 512, respectively) to introduce the H29F mutation. 

Construction of TaqEFT-Tth(AKK)-C/M3 (DNA sequence SEQ ID NO:513; 

ri amino acid sequence SEQ ID NO: 514) 

m 

J:| Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 

O 20 NO: 501) using the mutagenic primers 1044-038-05: 5'-ctc-ctc-aag-gcc-ctc-aga-gag-gac- 
ggg-gac-gcg-3' and 1044-038-06: 5'-cgc-gtc-ccc-gtc-ctc-tct-gag-ggc-ctt-gag-gag-3' (SEQ 
ID NO: 515 and 516, respectively) to introduce the K57R mutation. 
Construction of TaqEFT-Tth(AKK)-D/M5 (DNA sequence SEQ ID NO:517; 
amino acid sequence SEQ ID NO:518) 
25 Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 

NO: 501) using the mutagenic primers 1044-038-09: 5'-gac-gac-gtc-ctg-gcc-acc-ctg-gcc- 
aag-aag-gcg-3' and 1044-038-10: 5'-cgc-ctt-ctt-ggc-cag-ggt-ggc-cag-gac-gtc-gtc-3' 
(SEQ ID NO: 519 and 520, respectively) to introduce the S125T mutation. 

30 Construction of TaqEFT-Tth(AKK)-E/M6 (DNA sequence SEQ ID NO:521; 
amino acid sequence SEQ ID NO: 522) 
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Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ED 
NO: 501) using the mutagaiic primers 1044-038-11: 5'-ggg-gag-aag-acg-gcg-ctc-aag-ctt- 
ctg-gag-gag-3' and 1044-038-12: 5'-ctc-ctc-cag-aag-ctt-gag-cgc-cgt-ctt-ctc-ccc-3' (SEQ 
ID NO: 523 and 524, respectively) to introduce the R206L mutation. 

Construction of TaqEFT-Tth(AKK)-F/M7 (DNA sequence SEQ ID NO:525; amino 
acid sequence SEQ ID NO: 526) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 
NO: 501) using the mutagenic primers 1044-038-13: 5'-gag-ccc-gac-cgg-gag-ggg-ctt- 
aag-gcc-ttt-ctg-gag-agg-3' and 1044-038-14: 5'-cct-ctc-cag-aaa-ggc-ctt-aag-ccc-ctc-ccg- 
gtc-ggg-ctc-3 (SEQ ID NO: 527 and 528, respectively) to introduce the R269G and 
R27 IK mutations. 

Construction of TaqEFT-Tth(AKK)-G/M8 (DNA sequence SEQ ID NO:529; amino 
acid sequence SEQ ID NO: 530) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AEjKl) DNA (SEQ ID 
NO: 501) using the mutagenic primers 1044-038-15: 5'-cac-gag-ttc-ggc-ctt-ctg-gga-ggg- 
gag-aag-ccc-cgg-gag-gag-gcc-ccc-tgg-ccc-3' and 1044-038-16: 5'-ggg-cca-ggg-ggc-ctc- 
ctc-ccg-ggg-ctt-ctc-ccc-tcc-cag-aag-gcc-gaa-ctc-gtg-3' (SEQ ID NO: 531 and 532, 
respectively) to introduce the E290G, S291G, P292E, A294P, and L295R mutations. 

Construction of TaqEFT-Tth(AKIQ-H/M9 (DNA sequence SEQ ID NO:533; amino 
acid sequence SEQ ID NO: 534) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 
NO: 501) using the mutagenic primers 1044-038-17: 5'-ctg-gcc-ctg-gcc-gcc- tgc-agg- 
ggc-ggc-cgc-gtg-3' and 1044-038-18: 5'-cac-gcg-gcc-gcc-cct-gca-ggc-ggc-cag-ggc-cag-3' 
(SEQ ID NO: 535 and 536, respectively) to introduce the A328C mutation. 

Construction of TaqEFT-Tth(AKK)-I/M10 (DNA sequence SEQ ID NO:537; amino 
acid sequence SEQ ID NO: 538) 
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Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) DNA (SEQ ID 
NO: 501) using the mutagenic primers 1080-015-01 (SEQ ID NO:539) 5'-ggg gag aag 
acg gcg agg aag ctt ctg aag gag tgg ggg agc-3' and 1080-015-02 (SEQ ID NO: 540) 5'- 
gct ccc cca etc ctt cag aag ctt cct cgc cgt ctt etc ccc-3' to introduce the E210K mutation. 

5 

i; Construction of TaqEFT-Tth(AKIQ-Ml-9 (DNA sequence SEQ ID NO:541 ; amino 

acid sequence SEQ ID NO: 542) 

Seven independent PCR reactions were performed, using construct TaqEFT- 
Tth(AKK) (SEQ ED: 501) as a template, with the following pairs of mutagenic primers: 
10 PCR Reaction 1; 1044-038-01 5'-cag-acc-atg-aat-tcg-gag-gcg-atg-ctg-ccc-ctc-ttt-3' (SEQ 
ID NO:507) and 1044-038-04 5'-gag-gcc-ctt-cag-ggc-aaa-gaa-ggt-gcg-gta-ggc-3' (SEQ 
U ID NO:512) to yield a 108 base pair fragment, PCR Reaction 2; 1044-038-03 5'-gcc-tac- 
y cgc-acc-ttc-ttt-gcc-ctg-aag-ggc-ctc-3' (SEQ ID NO: 511) and 1044-038-06 5'-cgc-gtc- 

01 ccc-gtc-ctc-tct-gag-ggc-ctt-gag-gag-3' (SEQ ID NO: 516) to yield a 1 17 base pair 

2 15 jfragment, PCR Reaction 3; 1044-038-05 5'-ctc-ctc-aag-gcc-ctc-aga-gag-gac-ggg-gac- 
W gcg-3' (SEQ ID NO: 515) and 1044-038-10 5'-cgc-ctt-ctt-ggc-cag-ggt-ggc-cag-gac-gtc- 
■A gtc-3' (SEQ ID NO: 520) to yield a 237 base pair fi-agment, PCR Reaction 4; 1044-038- 
% 09 5'-gac-gac-gtc-ctg-gcc-acc-ctg-gcc-aag-aag-gcg-3' (SEQ ID NO: 519) and 1044-038- 

3 12 5'-ctc-ctc-cag-aag-ctt-gag-cgc-cgt-ctt-ctc-ccc-3' (SEQ ID NO: 524) to yield a 276 base 
Q 20 pair fragment, PCR Reaction 5; 1044-038-1 1 5'-ggg-gag-aag-acg-gcg-ctc-aag-ctt-ctg- 

" gag-gag-3' (SEQ ID NO: 523) and 1044-038-14 5'-cct-ctc-cag-aaa-ggc-ctt-aag-ccc-ctc- 

ccg-gtc-ggg-ctc-3' (SEQ ID NO: 528) to yield a 228 base pair fragment, PCR Reaction 6; 
1044-038-13 5'-gag-ccc-gac-cgg-gag-ggg-ctt-aag-gcc-ttt-ctg-gag-agg-3' (SEQ ID NO: 
527) and 1044-038-16 5'-ggg-cca-ggg-ggc-ctc-ctc-ccg-ggg-ctt-ctc-ccc-tcc-cag-aag-gcc- 
25 gaa-ctc-gtg-3' (SEQ ID NO: 532) to yield a 1 13 base pair fragment, PCR Reaction 7; 
1044-038-15 5'-cac-gag-ttc-ggc-ctt-ctg-gga-ggg-gag-aag-ccc-cgg-gag-gag-gcc-ccc-tgg- 
ccc-3' (SEQ ID NO: 531) and 1044-038-18 5'-cac-gcg-gcc-gcc-cct-gca-ggc-ggc-cag-ggc- 
cag-3' (SEQ ID NO: 536) to yield a 157 base pair fragment. The seven PCR products 
overlap such that PCR ampUfication in the absence of primers yielded the appropriate 
30 1005 base pair product to infroduce the K70E, G4EA, H29F, K57R, S125T, R206L, 
R269G, R271K, E290G, S291G, P292E, A294P, L295R, and A328C mutations. The 
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product was further amplified using the outside primers: 1044-038-1 (SEQ ID NO: 507) 
and 1044-038-18 (SEQ ID NO: 536) and cloned into pPCR-Script-Amp. From this 
construct, the nuclease domain was again amplified using primers: 1044-038-1 (SEQ ID 
NO: 507) and 1044-038-18 (SEQ ED NO: 536) and digested with EcoRI and Notl. The 

5 digested PGR product was ligated into an EcoRI and Notl digested TaqEFT-Tth(AKK) 
construct and transformed into JM109 for protein expression and screening. 



Construction of TaqEFT-Tth(AKK)-Ml-10 (DNA sequence SEQ ID NO:543; amino 
acid sequence SEQ ID NO: 544) 

hAlO Seven independent PGR reactions were performed, using construct TaqEFT- 

^ Tth(AKK) (SEQ ID: 501) as a template, with tihe following pairs of mutagenic primers: 
W PGR Reaction 1 ; 1 044-038-01 5 '-cag-acc-atg-aat-tcg-gag-gcg-atg-ctg-ccc-ctc-ttt-3' (SEQ 

1^ ID NO:507) and 1044-038-04 5'-gag-gcc-ctt-cag-ggc-aaa-gaa-ggt-gcg-gta-ggc-3' (SEQ 

ID NO:512) to yield a 108 base pair fragment, PGR Reaction 2; 1044-038-03 5'-gcc-tac- 
^- 15 cgc-acc-ttc-ttt-gcc-ctg-aag-ggc-ctc-3" (SEQ ID NO: 511) and 1044-038-06 5'-cgc-gtc- 

f I ccc-gtc-ctc-tct-gag-ggc-ctt-gag-gag-3' (SEQ ID NO: 5 1 6) to yield a 1 1 7 base pair 

m 

|| firagment, PGR Reaction 3 ; 1 044-03 8-05 5 '-ctc-ctc-aag-gcc-ctc-aga-gag-gac-ggg-gac- 
0 gcg-3' (SEQ ID NO: 515) and 1044-038-10 5'-cgc-ctt-ctt-ggc-cag-ggt-ggc-cag-gac-gtc- 
' ^ gtc-3' (SEQ ID NO: 520) to yield a 237 base pair fi-agment, PGR Reaction 4; 1044-038- 
20 09 5'-gac-gac-gtc-ctg-gcc-acc-ctg-gcc-aag-aag-gcg-3' (SEQ ID NO: 519) and 1080-42-02 
5'-gc ttc cag get ccc cca etc ctt cag aag ctt gag cgc cgt ctt etc ccc-3' (SEQ ID NO: 546) to 
yield a 299 base pair firagment, PGR Reaction 5; 1080-42-01 5'-ggg gag aag acg gcg etc 
agg ctt ctg aag gag tgg ggg age ctg gaa gc-3' (SEQ ID NO: 545) and 1044-038-14 5'-cct- 
ctc-cag-aaa-ggc-ctt-aag-ccc-ctc-ccg-gtc-ggg-ctc-3' (SEQ ID NO: 528) to yield a 228 base 
25 pair fi-agment, PGR Reaction 6; 1044-038-13 5'-gag-ccc-gac-cgg-gag-ggg-ctt-aag-gcc-ttt- 
ctg-gag-agg-3' (SEQ ID NO: 527) and 1044-038-16 5'-ggg-cca-ggg-ggc-ctc-ctc-ccg-ggg- 
ctt-ctc-ccc-tcc-cag-aag-gcc-gaa-ctc-gtg-3' (SEQ ID NO: 532) to yield a 113 base pair 
fragment, PGR Reaction 7; 1044-038-15 5'-cac-gag-ttc-ggc-ctt-ctg-gga-ggg-gag-aag-ccc- 
cgg-gag-gag-gcc-ccc-tgg-ccc-3' (SEQ ID NO: 531) and 1044-038-18 5'-cac-gcg-gcc-gcc- 
30 cct-gca-ggc-ggc-cag-ggc-cag-3' (SEQ ID NO: 536) to yield a 1 57 base pair fragment. 
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The seven PCR products overlap such that PGR ampUfication in the absence of primers 
yielded flie appropriate 1005 base pair product to introduce the K70E, G4EA, H29F, 
K57R, S125T, R206L, E210K, R269G, R271K, E290G, S291G, P292E, A294P, L295R, 
and A328C mutations. The product was further amplified using the outside primers: 

5 1044-038-1 (SEQ ID NO: 507) and 1044-038-18 (SEQ ID NO: 536) and cloned into 
pPCR-Script-Amp. From this construct, the nuclease domain was again ampHfted using 
primers: 1044-038-1 (SEQ ID NO: 507) and 1044-038-18 (SEQ ID NO: 536) and 
digested with EcoRI and Notl. The digested PCR product was ligated into an EcoRI and 
NotI digested TaqEFT-Tth(AKK) construct and transformed into JM109 for enzyme 

1 0 expression and screening. 

5 K69E Mutation of FEN enzymes to further improve substrate specificity 

J Construction of Tth(K69E)AKK, Taq(K69E)TthAKK, Tfi(K69E)TthAKK and 
U 15 Tsc(K69£)TthAKK and mutants 

g Site specific mutagenesis was performed on TthAKK, TaqTthAKK, TfiTthAKK 

jjf and TscTthAKK DNAs using the mutagenic primers 5'-atc-gtg-gtc-ttt-gac-gcc-gag-gcc- 
il ccc-tcc-ttc-c-3' (SEQ ID NO: 492) and 5'-gga-agg-agg-ggg-cct-cgg-cgt-caa-aga-cca-cga- 
Q t-3 ' (SEQ ID NO: 493) to introduce the K69E mutation and to generate Tth(K69E)AKK 

20 (DNA sequence SEQ ID NO: 452; amino acid sequence SEQ ID NO: 494), 

Taq(K69E)ThAKK, (DNA sequence SEQ ID NO: 495; amino acid sequence SEQ ID 
NO: 496), Tfi(K69E)TthAKK (DNA sequence SEQ ID NO: 497; amino acid sequence 
SEQ ID NO: 498)and Tsc(K69E)TthAKK (DNA sequence SEQ ID NO: 499; amino acid 
sequence SEQ ID NO: 500) mutant enzymes. 

25 

Constniction of Tsc(167-334)TthAKK 

Two overlapping PCR fragments were generated by primers 390-76-08: 5'-tgt- 
gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 1044-041-01: 5'-ctt-ctc-tca-tcc-gcc-aaa-aca- 
gcc-3' (SEQ ID NO: 451) with template Tth(K69E)AKK, (DNA sequence SEQ ID NO: 
30 452), and primers 1044-041-02: 5'-ctc-ctc-cac-gag-ttc-ggc-3' (SEQ ID NO: 453) and 
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209-074-02: 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) with template 
Tsc(K69E)TthAKK, (DNA sequence SEQ ID NO: 454). Two products were combined 
and amplified with outside primers 390-76-08 and 209-074-02. The recombinant PGR 
product was cut with the restriction enzymes EcoRI and NotI and Ugated into the vector 
5 TthAKK which was prepared by cutting with the same enzymes to yield Tsc(167- 

333)TthAKK construct (DNA sequence SEQ ID NO: 455; amino acid sequence SEQ ID 
NO: 456). 

Construction of Tsc(222-334)TthAKK 

1 0 Two overlapping PGR fragments were generated by primers 3 90-76-08 : 5 ' -tgt- 

H gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 1044-041-03: 5'-ttc-cag-gtg-ctt-gag-gag-gtt- 
r\ ttc-cag-3' (SEQ ID NO: 457) with template Tth(K69E)AKK (DNA sequence SEQ ID 

-sar 

NO: 452), and primers 1044-041-04: 5'-ctc-ctc-aag-cac-ctg-gaa-cag-gtg-aaa-3' (SEQ ID 

m NO: 458) and 209-074-02: 5 '-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID N0:3 1 6) with 

jjf 

:n 15 template Tsc(K69E)TthAKK, (DNA sequence SEQ ID NO: 454). Two products were 

f;^ combined and amplified with outside primers 390-76-08 and 209-074-02. The 

f I recombinant PGR product was cut with the restriction enzymes EcoRI and NotI and 

ligated into the vector TthAKK which was prepared by cutting with the same enzymes to 
p yield Tsc(222-334)TthAKK construct (DNA sequence SEQ ID NO: 459; amino acid 

' " 20 sequence SEQ ID NO: 460). 

Construction of Tfi(222-334)TthAKK 

Two overlapping PGR fragments were generated by primers 390-76-08: 5'-tgt- 
gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 1044-058-05: 5'-gtc-cag-gtt-ctt-gag-gag-gtt- 

25 ttc-cag-3' (SEQ ID NO: 461) with template Tth(K69E)AKK (DNA sequence SEQ ID 
NO: 452), and primers 1044-058-06: 5'-ctc-ctc-aag-aac-ctg-gac-cgg-gta-aag-3' (SEQ ID 
NO: 462) and 209-074-02: 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) with 
template Tfi(K69E)TthAKK (DNA sequence SEQ ID NO: 497). Two products were 
combined and amplified with outside primers 390-76-08 and 209-074-02. The 

30 recombinant PGR product was cut with the restriction enzymes EcoRI and NotI and 
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ligated into the vector TthAKK which was prepared by cutting with the same enzymes to 
yield Tfi(222-334)TthAKK construct (DNA sequence SEQ ID NO: 463; amino acid 
sequence SEQ JD NO: 464). 



5 Construction of Tfi(167-334)TthAKK 

Two overlapping PGR fragments were generated by primers 390-76-08 5'-tgt- 
gga-att-gtg-agc-gg-3 ' (SEQ ID N0:3 14) and 1 044-058-07 5 '-gac-gtc-ctt-cgg-ggt-gat- 
gag-gtg-gcc-3' (SEQ ID NO: 465) with template Tth(K69E)AKK, (DNA sequence SEQ 
ID NO: 452), and primers 1044-058-08 5'-atc-acc-ccg-aag-gac-gtc-cag-gag-aag-3' (SEQ 
10 ID NO: 466) and 209-074-02 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) 
with template Tfi(K69E)TthAKK, (DNA sequence SEQ ID NO: 498). Two products 
O were combined and amplified with outside primers 390-76-08 and 209-074-02. The 

o 

5l recombinant PGR product was cut with the restriction enzymes EcoRI and NotI and 
J Ugated into the vector TthAKK which was prepared by cutting with the same enzymes to 
U 15 yield Tfi(167-333)TthAKK construct (DNA sequence SEQ ID NO: 467; amino acid 
sequence SEQ ID NO: 468). 

re 

Ql Construction of Tsc(lll-334)TthAKK 

is Two overlapping PGR fragments were generated by primers 390-76-08 5 '-tgt- 

20 gga-att-gtg-agc-gg-3 ' (SEQ ID N0:3 14) and 1 044-058-01 5 '-ctc-gag-gcg-ggt-aaa-ccc- 
cag-gag-gtc-3' (SEQ ID NO: 469) with template Tth(K69E)AKK (DNA sequence SEQ 
ID NO: 452), and primers 1044-058-02 5'-ggg-ttt-acc-cgc-ctc-gag-gtg-ccc-ggc-3' (SEQ 
ID NO: 470) and 209-074-02 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) 
with template Tsc(K69E)TthAKK (DNA sequence SEQ ID NO: 499). Two products 
25 were combined and amplified with outside primers 390-76-08 and 209-074-02. The 
recombinant PCR product was cut with the restriction enzymes EcoRI and NotI and 
ligated into the vector TthAKK which was prepared by cutting witii the same enzymes to 
yield Tsc(167-333)TthAKK construct (DNA sequence SEQ ID NO: 471; amino acid 
sequence SEQ ID NO: 472). 

30 
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Construction of Tsc(l-167)TthAKK 

Two overlapping PGR fragments were generated by primers 390-76-08 5'-tgt- 
gga-att-gtg-agc-gg-3' (SEQIDNO:314) and 1044-058-03 5'-aag-cca-ctc-cgg-ggt-gat- 
cag-gta-acc-3' (SEQ ID NO: 473) with template Tsc(K69E)TthAKK (DNA sequence 
5 SEQ ID NO: 499), and primers 1044-058-04 5'-atc-acc-ccg-gag-tgg-ctt-tgg-gag-aag-3' 
(SEQ ID NO: 474) and 209-074-02 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID 
NO:316) with template Tth(K69E)AKK (DNA sequence SEQ ID NO: 452). Two 
products were combined and amplified with outside primers 390-76-08 and 209-074-02. 
The recombinant PGR product was cut with the restriction enzymes EcoRI and NotI and 
10 ligated into the vector TthAKK which was prepared by cutting with the same enzymes to 
yield Tsc(l-167)TthAKK construct (DNA sequence SEQ ID NO: 475; amino acid 
a sequence SEQ ID NO: 476). 

vv 

SSft 

^ Modification of AfiiFEN enzymes 
15 

li Construction of pAfuFEN 

I'M 

Plasmid pAfiiFENl was prepared as described [U.S. Patent application Ser. No. 
^ 09/684,938, WO 98/23774, each incorporated herein by reference in their entireties]. 

Briefly, genomic DNA was prepared from one vial (approximately 5 ml of culture) of 
20 live A. fulgidus bacteria from DSMZ (DSMZ #4304) with the DNA XTRAX kit (Gull 
Laboratories, Salt Lake City, UT) according to the manufacturer's protocol. The final 
DNA pellet was resuspended in 100 ix\ of TE (10 mM Tris HCl, pH 8.0, 1 mM EDTA). 
One microliter of the DNA solution was employed in a PCR using the ADVANTAGE 
cDNA PCR kit (Clonetech); the PCR was conducted according to manufacturer's 
25 recommendations. 

The 5^ end primer is complementary to the 5' end of the Afti FEN-1 gene except it 
has a 1 base pair substitution to create an Nco I site. The 3' end primer is complemetary 
to the 3' end of the Afii FEN-1 gene downstream from the FEN-1 ORE except it contains 
a 2 base substitution to create a Sal I site. The sequences of the 5' and 3' end primers are 
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5'-CCGTCAACATTTACCATGGGTGCGGA-3^ (SEQ ID NO:617) and 
5'-CCGCCACCTCGTAGTCGACATCCTTTTCGTG (SEQ ID NO:618), respectively. 
Cloning of the resulting fragment was as described for the PfuFENl gene, U.S. Patent 
application Ser. No. 5,994,069, incorporated herein in its entirety for all purposes, to 
5 create the plasmid pTrc99-AFFENl . For expression, the pTrcAfuHis plasmid was 
constructed by modifying pTrc99-AFFENl, by adding a histidine tail to facilitate 
purification. To add this histidine tail, standard PGR primer-directed mutagenesis 
methods were used to insert the coding sequence for six histidine residues between the 
last amino acid codon of the pTrc99-AFFENl coding region and the stop codon. The 
10 resulting plasmid was termed pTrcAfuHis. The protein was then expressed as described 
and purified by binding to a Ni++ affinity column. 

5 Construction of Afu(Y236A), A46-5 

W Two overlapping PCR fragments were generated from template AfuFEN (SEQ ID 

=|i 15 NO:556) with primers: 785-73-04 5 '-ctg-gtc-ggg-acg-gac-gcc-aat-gag-ggt-gtg-aag-3' 
L (SEQ ID NO: 547) and 700-10-03 5'-ctt-ctc-tca-tcc-gcc-aaa-aca-gcc-3' (SEQ ID 

2 NO:343), and primers: 390-076-08 5'-tgt-gga-att-gtg-agc-gg-3' (SEQ ID N0:3 14) and 
P 785-73-06 5'-gtc-cgt-ccc-gac-cag-aat-3' (SEQ ID NO: 548). The two products were 

3 combined and amplified with outside primers 700-10-03 and 390-076-08. The 

20 recombinant PCR product was cut with the restriction enzymes Ncol and Sail and ligated 
into the AfuFEN construct which was prepared by cutting with the same enzymes to yield 
the Afu(Y236A) construct (DNA sequence SEQ ID NO: 549; amino acid sequence SEQ 
ID NO: 550). 

25 Construction of Afu(Y236R), A56-9 

Two overlapping PCR fragments were generated using the template AfuFEN with 
primers: 785-73-05 5'-ctg-gtc-ggg-acg-gac-agg-aat-gag-ggt-gtg-aag-3' (SEQ ID NO: 
551) and 700-10-03 5'-ctt-ctc-tca-tcc-gcc-aaa-aca-gcc-3' (SEQ ID NO:343), and primers: 
390-076-08 5'-tgt-gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 785-73-06 5'-gtc-cgt-ccc- 
30 gac-cag-aat-3' (SEQ ID NO: 548). The two products were combined and amplified with 
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outside primers 700-10-03 and 390-076-08. The recombinant PGR product was cut with 
the restriction enzymes Ncol and Sail and hgated into the AfuFEN construct which was 
prepared by cutting with the same enzymes to yield Afii(Y236R) construct (DNA 
sequence SEQ JD NO: 552; amino acid sequence SEQ ID NO: 553). 

2. Chimeras of FEN enzymes and Thermus Polymerase derivatives 

The following enzyme constructs combine portions of the AfuFEN enzyme 
polymerase domain and the polymerase domain of Thermus ploymerases. These 
combinations were designed based on information generated by molecular modeling. 

Construction of Afu336-Tth296(AKK), ATl-1 

Two overlapping PGR fragments were generated. The first firagment was made 
using the AfuFEN construct (SEQ ID NO:556) as template with the primers: 390-076-08 
5'-tgt-gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 390-065-05 5'-gaa-cca-cct-ctc-aag- 
cgt-gg-3' (SEQ ID NO: 554). The second fragment was made using TthAKK (SEQ ID 
NO:558) as template and the primers: 700-049-01 5'-acg-ctt-gag-agg-tgg-ttc-ctg-gag- 
gag-gcc-ccc-tgg-3^ (SEQ ID NO: 555) and 390-076-09 5'-taa-tct-gta-tca-ggc-tg-3' (SEQ 
ID NO:557). The two products contain a region of sequence overlap, and were combined 
and ampUfied with outside primers 390-076-08 and 390-076-09 in a recombinant PGR 
reaction. The recombinant PGR product was cut with the restriction enzymes BspEI and 
Sail and hgated into the AfuFEN construct which was prepared by cutting with the same 
enzymes to yield Afu336-Tth296(AKK) construct (DNA sequence SEQ ID NO: 559; 
amino acid sequence SEQ ID NO: 560). 

Construction of Afu328-Tth296(AKK), AT2-3 

Two overlapping PGR fragments were generated, the first by primers: 390-076-08 
5'-tgt-gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 700-049-02 5'-ggt-tga-ctt-cag- 
agc-ttt-gag-3' (SEQ ID NO: 561) with template AfuFEN (DNA sequence SEQ ID 
NO:556), and the second by primers: 700-049-03 5'-aaa-gct-ctg-aag-tca-acc-ctg- 
gag-gag-gcc-ccc-tgg-3* (SEQ ID NO: 562) and 390-076-09 5'-taa-tct-gta-tca-ggc- 
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tg-3' (SEQ ID NO:557) with template TthAKK (DNA sequence SEQ ID 
NO:558). The two products were combined and amplified with outside primers 
390-076-08 and 390-076-09. The recombinant PGR product was cut with the 
restriction enzymes BspEI and Sail and ligated into the vector pAfuFEN which 
was prepared by cutting with the same enzymes to yield Afu336-Tth296(AKK) 
construct (DNA sequence SEQ ID NO: 563; amino acid sequence SEQ ID NO: 
564). 

Construction of Afu336-Taq5M 

The Taq5M construct (SEQ ID N0:41) was cut with the enzymes NotI and Sail 
and the smaller insert fragment was gel isolated. The Afu336-Tth296(AKK) construct 
(DNA sequence SEQ ID NO: 559) was also cut with the same restriction enzymes and 
the larger vector fragment was purified. The insert (TaqSM polymerase domain) was 
then ligated into the vector to generate the Afu336-Taq5M construct (DNA sequence 
SEQ ID NO: 565; amino acid sequence SEQ ID NO: 566). 

Construction of Afu336-TaqDN 

The TaqDNHT construct was cut with the enzymes NotI and Sail and the smaller 
insert fi'agment was gel isolated. The Afii336-Tth296(AKK) construct (DNA sequence 
SEQ ID NO: 559) was also cut with the same restriction enzymes and the larger vector 
fragment was purified. The insert (TaqDN polymerase domain) was then ligated into the 
vector to generate the Afii336-Taq5M construct (DNA sequence SEQ ID NO: 567; amino 
acid sequence SEQ ID NO: 568). 

Random Chimerization of Thermus Polymerases 

Numerous enzymes with altered fimctions were generated via random 
chimerization of Thermus polymerases based on the principal of DNA shuffling (Volkov 
and Arnold, Methods in Enzymology 328:456 [2000]). The procedure below was used to 
develop the following chimeras. 
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The genes of interest for random chimerization were PGR amplified with primers: 
390-076-08 5'4gt-gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 209-074-02: 5'-gtg-gcc- 
tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) to generate the approximately 1.4 kbp 
templates. About 2 jag of the DNA templates were mixed in equal proportion, and then 
digested with DNase I (0.33 U) in a 30 \xl reaction at 15° C for approximately 1 minute to 
generate fragments 50-200 bp in size. DNA fragments were purified in a 4 % agarose gel 
and extracted by QIAEXII gel extraction kit (QIAGEN) according to manufacturer's 
instructions. The purified firagments (10 ^1) were added to 10 |al of 2X PGR pre-mix (5- 
fold diluted, cloned Pfu buffer, 0.4 mM each dNTP, 0.06 cloned Pfu DNA 
polymerase, STRATAGENE Gat.# 600153, with accompanying buffer) for the fragment 
reassembly reaction (PGR program: 3 min 94 ""C followed by 40 cycles of 30 sec 94 ""C, 1 
min 52 °G, 2 min + 5 s/cycle 72 °G, followed by 4 min at 72 ^G). The reassembled 
products (1 |Lil of a 10-fold dilution) were then PGR ampUfied with a pair of nested 
primers: 072-090-01 5'-gag-cgg-ata-aca-att-tca-cac-agg-3' (SEQ ID NO: 569) and 189- 
082-01 5'-tgc-ccg-gtg-cac-gcg-gcc-gcc-cct-gca-ggc-3' (SEQ ID NO: 570) using the 
GLONTEGH GG melt cDNA PGR kit (Gat.# K1907-Y) according to the manufacturer's 
instructions. The purified PGR products were digested with restriction enzymes EcoRI 
and NotI and then hgated into the TthAKK construct that was prepared by cutting with 
the same enzymes. The ligation mixture was transformed into JM109 competent cells 
and colonies were screened for enzyme activity. 



1. Generation of Random Chimeras S26 and S36 

The templates used to develop these chimeric enzymes were the nuclease domains 
from TthAKK, TaqTthAKK, TscTthAKK and the TfiTthAKK constructs. Two clones 
were found to show improvement of activity based on primary activity screening. 
Random chimeras S26 (DNA sequence SEQ ID NO: 571; amino acid sequence SEQ ID 
NO: 572) and 836 (DNA sequence SEQ ID NO: 573; amino acid sequence SEQ ID NO: 
574) were then sequenced and isolated. 
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2. Introduction of L107F/E108T, LlOPFA'^llOT and K69E mutations in S26 
and S36 to improve substrate specificity 
Construction of S26(FT) 

Site specific mutagenesis was performed on pS26 DNA using the mutagenic 
5 primers: 785-008-03 5'-ttt-acc-cgc-ctc-gag-gtg-ccg-ggc-3' (SEQ ID NO: 489) and 680- 
21-03 5'-cgg-cac-ctc-gag-gcg-ggt-aaa-gcc-caa-aag-gtc-cac-3' (SEQ ID NO: 490) to 
introduce the L107F/E108T mutations and generate S26(Fr) (DNA sequence SEQ ID 
NO: 575; amino acid sequence SEQ ID NO: 576). 
Construction of S36(FT) 
10 Site specific mutagenesis was performed on pS36 DNA using the mutagenic 

primers: 785-096-01: 5'-gtg-gac-ctt-ctg-ggc-ttt-acc-cgc-ctc-gag-gcc-ccg-3' (SEQ ID NO: 
486) and 785-096-02: 5 '-cgg-ggc-ctc-gag-gcg-ggt-aaa-gcc-cag-aag-gtc-cac-3 ' (SEQ ID 
d NO: 487) to introduce the L109FA^1 lOT mutations and generate S36(FT) (DNA 
5 sequence SEQ ID NO: 577; amino acid sequence SEQ ID NO: 578). 
H 15 Construction of S26(K69E) 

Site specific mutagenesis was performed on S26 DNA using the mutagenic 
Q primers: 5'-atc-gtg-gtc-ttt-gac-gcc-gag-gcc-ccc-tcc-ttc-c-3' (SEQ ID NO: 492) and 5'- 
jj gga-agg-agg-ggg-cct-cgg-cgt-caa-aga-cca-cga-t-3' (SEQ ID NO: 493) to introduce the 

is K69E mutation and to generate S26(K69E) (DNA sequence SEQ ID NO: 579; amino 
£ 20 acid sequence SEQ ID NO: 580). 

Construction of S26(Fr/K69E) 

Site specific mutagenesis was performed on S26(FT) DNAs using the mutagenic 
primers: 5'-atc-gtg-gtc-ttt-gac-gcc-gag-gcc-ccc-tcc-ttc-c-3' (SEQ ID NO: 492) and 5'- 
gga-agg-agg-ggg-cct-cgg-cgt-caa-aga-cca-cga-t-3' (SEQ ID NO: 493) to introduce the 
25 K69E mutation and to generate S26(FT/K69E) (DNA sequence SEQ ID NO: 581; amino 
acid sequence SEQ ID NO: 582). 



3. More random chimeras, N3D7, N1A12 N1C4, and N2C3 
The templates used to generate tiiese chimerics were the nuclease domains of 
30 Tth(K69E)AKK, Taq(K69E)TthAKK, Tsc(K69E)TthAKK and Tfi(K69E)TthAKK 
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constructs. Four clones were shown to have improved activity based on the primary 
activity screening. Random chimeras N3D7 (DNA sequence SEQ ID NO: 583; amino 
acid sequence SEQ ID NO: 584), Nl A12 (DNA sequence SEQ ID NO: 585; amino acid 
sequence SEQ ID NO: 586), N1C4 (DNA sequence SEQ ID NO: 587; amino acid 
5 sequence SEQ ID NO: 588) and N2C3 (DNA sequence SEQ ID NO: 589; amino acid 
sequence SEQ ID NO:590) were then sequenced and isolated respectively. 
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Generation of G, L and GL variants 

10 A variant of Tth polymerase having an Asp to Gly change at amino acid 385 (D385G) 
and an Arg toLeu change at amino acid 455 (R455L), (amino acid SEQ ID NO:2712; 
gene sequence SEQ ID NO:2713) was used to create derivatives and chimerical enzymes. 
Constructs comprising these two amino acid variations are indicated by the 'GL' 
designation, below. Chimeras comprising only one of the variations are indicated as 
15 either *G' or 'L*. To create a non-polymerizing variant, the sequence encoding an aspartic 
acid at position 787 was changed by site-specific mutagenesis as described above to a 
sequence encoding asparagine. Mutagenesis of pTrcTth GL-2 with the following 
oligonucleotide: 5'-CAGGAGGAGCTCGTTGTGGACCTGGA-3' (SEQ ID NO:260) 
was performed to create the plasmid pTrcTthDN GL. The mutant protein and coding 
20 nucleic acid sequence is termed TthDN GL, SEQ ID NOS:2714 and 2715 respectively. 

A. Ttli DN HI GL 

Six amino acid histidine tags (his-tags) were added onto the carboxy termini of 
Tth DN GL, as described above for Tth DN. The resulting mutant gene is termed Tth 
25 DN HT GL (SEQ ID NO:2716, nucleic acid sequence; SEQ ID NO:2717, amino acid 
sequence). The protein was expressed and purified as described for Tth DN HT, above. 

B. Generation of Tth DN RX HT GL 

Mutagenesis was performed to introduce 3 additional, unique restriction sites into 
30 the polymerase domain of Tth DN HT enzymes. Site-specific mutagenesis was 
performed using the Transformer Site-Directed Mutagenesis Kit from (Clonetech) 



m 
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according to manufacturer's instructions. One of two different selection primers, Trans 
Oligo AlwNFSpel or Switch Oligo Spel/AlwNI (Clontech, Palo Alto CA catalog 
#6488-1 or catalog #6373-1) was used for all mutagenesis reactions described. The 
selection oligo used in a given reaction is dependent on the selection restriction site 
5 present in the vector. All mutagenic primers were synthesized by standard synthetic 
chemistry. Resultant colonies were expressed in E.coli strain JM109. 

The Not I site (amino acid position 328) was created using the mutagenic primers 
5'-gccgccaggggcggccgcgtccaccgggcc (SEQ ID NO:269) and 
5'-gcctgcaggggcggccgcgtgcaccggggca (SEQ E) NO:270) corresponding to the sense 
10 strand of the Tth DN HT GL gene. The BstI (amino acid position 382) and Ndel (amino 
acid position 443) sites were introduced into the gene using sense strand mutagenic 
p primes 5'-ctcctggacccttcgaacaccacccc (SEQ ID NO:271) and 5'-gtcctggcccatatggaggccac 

(SEQ ID NO:272). The mutant plasmid was over-expressed and purified using Qiagen 
4^ QiaPrep Spin Mini Prep Kit (cat. # 27106). The vector were tested for the presence of 
Q 15 the restriction sites by DNA sequencing and restriction mapping. This construct is 
termed Tth DN RX HT GL (DNA sequence SEQ ID NO:271 8; amino acid SEQ ID 

13 NO:2719). Chimeric constructs were created using Tth DN RX HT GL. 

11 
11. 

J 1, Constructioii of TaqTth(N) GL 

n 20 The first exchange that was performed involved the polymerase domains of the 

two enzymes. Separation of the nuclease domain (the N-terminal end of the protein) 
from the polymerase domain (the C-terminal portion of the protein) was accomplished by 
cutting both genes with the restriction endonucleases EcoRI and NotL The 
approximately 900 base pair fragment from the Taq DN RX HT gene was cloned into the 
25 homologous sites of the Tth DN RX HT GL gene, yielding the chimera TaqTth(N) GL 
(DNA sequence SEQ ED NO:2675; amino acid sequence SEQ ID NO:2641) which has 
the Taq DN RX HT 5' nuclease domain and the Tth DN RX HT GL polymerase domain. 

1. Construction of TaqTth(N-B) GL 

30 The Taq DN RX HT construct was cut with the enzymes Ndel and BamHI and 

the larger, vector fragment was gel isolated as detailed above. The Tth DN RX HT GL 
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construct was also cut with Ndel and BamHI and the smaller (approximately 795 base 
pairs) Tth fragment was gel isolated and purified. The Tth GL Ndel-BamHI insert was 
ligated into the Taq Ndel-BamHI vector as detailed above to generate the TaqTth(N-B) 
GL (DNA sequence SEQ ID NO:2676; amino acid sequence SEQ ID NO:2642). 

5 

3. Construction of TaqTth(N-D) G 

The Taq DN RX HT construct was cut with the enzymes NotI and Ndel and the 
larger vector fragment was isolated as detailed above. The Tth DN RX HT GL construct 
was also cut with NotI and Ndel and the smaller (approximately 345 base pairs) Tth 
10 fragment was gel isolated and purified. The Tth GL Notl-Ndel insert was ligated into the 
Taq Notl-Ndel vector as detailed above to generate the TaqTth(N-D) G (DNA sequence 

H SEQ ID NO:2677; amino acid sequence SEQ ID NO:2643). 

Q 

1 4. Constniction of TaqTth(D-B) L 

vl 1 5 The Taq DN RX HT construct was cut with the enzymes Ndel and BamHI and 

Jj5 the larger vector fragment was isolated as detailed above. The Tth DN RX HT GL 

1^ construct was also cut with Ndel and BamHI and the smaller (approximately 450 base 

ri pairs) Tth fragment was gel isolated and purified. The Tth GL Ndel-BamHI insert was 

Q Ugated into the Taq Ndel-BamHI vector as detailed above to generate the TaqTth(D-B) L 

9 20 (DNA sequence SEQ ID NO:2678; amino acid sequence SEQ ID NO:2644). 

HI 

5. Construction of TaqTth(Bs-B) L 

The Taq DN RX HT construct was cut with the enzymes BstBI and BamHI and 
the larger vector fragment was isolated as detailed above. The Tth DN RX HT GL 
25 construct was also cut with BstBI and BamHI and the smaller (approximately 633 base 
pairs) Tth fragment was gel isolated and purified. The Tth GL Ndel-BamHI insert was 
Ugated into the Taq Ndel-BamHI vector as detailed above to generate TaqTth(Bs-B) L 
(DNA sequence SEQ ID NO:2679; amino acid sequence SEQ ID NO:2645). 

30 6. Construction of TaqTth(N-Bs) G 
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The Taq DN RX HT construct was cut with the enzymes NotI and BstBI and the 
larger vector fragment was isolated as detailed above. The Tth DN RX HT GL construct 
was also cut with NotI and BstBI and the smaller (approximately 162 base pairs) Tth 
fragment was gel isolated and purified. The Tth GL Notl-BstBI insert was Ugated into 
5 the Taq Notl-BstBI vector as detailed above to generate TaqTth(N-Bs) G (DNA sequence 
SEQ ID NO:2680; amino acid sequence SEQ ID NO:2646). 

7. Construction of TthTaq(B-S) GL 

The Tth DN RX HT GL construct was cut with the enzymes BamHI and Sail and 
10 the largCT vector fragment was isolated as detailed above. The Taq DN RX HT construct 
was also cut with BamHI and Sail and the smaller (approximately 741 base pairs) Taq 
fragment was gel isolated and purified. The Taq BamHI-Sall insert was ligated into the 
Tth GL BamHI-Sall vector as detailed above to generate the TthTaq(B-S) GL (DNA 
sequence SEQ ID NO:2681; amino acid sequence SEQ ID NO:2647). 

15 

8. Construction of TaqTth(D-B) L E404H (DNA sequence SEQ ID 
NO:2684; amino acid sequence SEQ ID NO:2650) 

Site specific mutagenesis was performed on pTrc99A TaqTth(D-B) L DNA using 
the mutagenic primer 240-60-01 5'-gag gag gcg ggg cac egg gcc gcc ctt-3' (SEQ ID 
II 20 NO:277) to introduce the E404H mutation. 

9. Construction of TaqTth(D-B) L F413H/A414R (DNA sequence SEQ 
ID NO:2685; amino acid sequence SEQ ID NO:2651) 

Site specific mutagenesis was performed on pTrc99A TaqTth(D-B) L DNA using 
25 the mutagenic primer 240-60-02 5 '-ctt tec gag agg etc cat egg aac ctg tgg ggg agg-3 ' 
(SEQ ID NO:278) to introduce the F413H and the A414R mutations. 

10. Construction of TaqTth(D-B) L W417L/G418K (DNA sequence SEQ 
ID NO:2686; amino acid sequence SEQ ID NO:2652) 



if 
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Site specific mutagenesis was performed on pTrc99A TaqTth(D-B) L DNA using 
the mutagenic primer 240-60-03 5'-ctc ttc gcc aac ctg ctt aag agg ctt gag ggg gag-3' 
(SEQ E) NO:279) to introduce the W417L and the G418K mutations. 

5 11. Construction of TaqTth(D-B) L A439R (DNA sequence SEQ ID 

NO:2687; amino acid sequence SEQ ID NO:2653) 
Site specific mutagenesis was performed on pTrc99A TaqTth(D-B) DNA using 
the mutagenic primer 240-60-04 5'-agg ccc ctt tec egg gtc ctg gcc cat-3' (SEQ JD 
NO:280) to introduce the A439R mutation. 

10 

12. Construction of TaqTth(N-D) G L451R (DNA sequence SEQ ID 

P NO:2688; amino acid sequence SEQ ID NO:2654) 

f-i 

5 Site specific mutagenesis was performed on pTrc99A TaqTth(N-D) G DNA using 

;U the mutagenic primer 240-60-05 5'-acg ggg gtg cgc egg gac gtg gcc tat-3' (SEQ ED 

W 15 NO:281) to introduce the L415 mutation. 

n 

O 13. Construction of TaqTth(N-D) G R457Q (DNA sequence SEQ ID 

i| NO:2689; amino acid sequence SEQ ID NO:2655) 

Jl" Site specific mutagenesis was performed on pTrc99A TaqTth(N-D) G DNA usmg 

20 the mutagenic primer 240-60-06 5 '-gtg gcc tat etc cag gcc ttg tec ctg-3' (SEQ ID 
NO:282) to inlxoduce the L415Q mutation. 

14. Construction of TaqTth(N-D) G V463L (DNA sequence SEQ ID 
NO:2690; amino acid sequence SEQ ID NO:2656) 

25 Site specific mutagenesis was performed on pTrc99A TaqTth(N-D) G DNA using 

the mutagenic primer 240-60-07 5 '-ttg tec ctg gag ctt gcc gag gag atc-3' (SEQ ID 
NO:283) to introduce the V463L mutation. 

15. Construction of TaqTth(N-D) G A468R (DNA sequence SEQ ID 
30 NO:2691; amino acid sequence SEQ ID NO:2657) 
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Site specific mutagenesis was perfomied on pTrc99A TaqTth(N-D) G DNA using 
the mutagenic primer 240-60-08 5'-gcc gag gag ate cgc cgc etc gag gcc-3' (SEQ ID 
NO:284) to introduce the A468R mutation. 

16. Construction of TaqTth(N-D) G A472E (DNA sequence SEQ ID 
NO:2692; amino acid sequence SEQ ID NO:2658) 

Site specific mutagenesis was performed on pTrc99A TaqTth(N-D) G DNA using 
the mutagenic primer 240-60-09 5'-gcc cgc etc gag gag gag gtc ttc cgc-3' (SEQ ID 
NO:285) to introduce the A472E mutation, 

17. Construction of TaqTth(N-D) G G499R (DNA sequence SEQ ID 
NO:2693; amino acid sequence SEQ ID NO:2659) 

Site specific mutagenesis was performed on pTrc99A TaqTth(N-D) G DNA using 
the mutagenic primer 240-60-10 5'-ttt gac gag eta agg ctt ccc gcc atc-3' (SEQ ID 
NO:286) to introduce the G499R mutation. 

18. Construction of TaqTth(N-D) G E507Q (DNA sequence SEQ ID 
NO:2694; amino acid sequence SEQ ID NO:2660) 

Site specific mutagenesis was performed on pTrc99A TaqTth(N-D) G DNA using 
the mutagenic primer 276-046-04 5 '-ate gcc aag acg caa aag acc ggc aag-3' (SEQ ID 
NO:287) to introduce the E507Q mutation. 

19. Construction of TaqTtli(N-D) G Y535H (DNA sequence SEQ ID 
NO:2695; amino acid sequence SEQ ID NO:2661) 

Site specific mutagenesis was performed on pTrc99A TaqTth(N-D) G DNA using 
the mutagenic primer 240-60-1 1 5 '-aag ate ctg cag cac egg gag etc acc-3' (SEQ ID 
NO:288) to introduce the Y535H mutation. 

20. Construction of TaqTth(N-D) G S543N (DNA sequence SEQ ID 
NO:2696; amino acid sequence SEQ ID NO:2662) 
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Site specific mutagenesis was performed on pTrc99A TaqTth(N-D) G DNA using 
the mutagenic primer 240-60-12 5'-acc aag ctg aag aac acc tac att gac-3' (SEQ ID 
NO:289) to introduce the S543N mutation. 

21. Construction of TaqTth(N-D) G I546V (DNA sequence SEQ ID 
NO:2697; amino acid sequence SEQ ID NO:2663) 

Site specific mutagenesis was performed on pTrc99A TaqTth(N-D) G DNA using 
the mutagenic primer 240-60-13 5'-aag age acc tac gtg gac ccc ttg ccg-3' (SEQ ID 
NO:290) to introduce the I546V mutation. 

22. Construction of TaqTth(N-D) G D551S/I553V (DNA sequence SEQ 
ID NO:2698; amino acid sequence SEQ ID NO:2664) 

Site specific mutagenesis was performed on pTrc99A TaqTth(N-D) G DNA using 
the mutagenic primer 240-60-14 5 '-att gac ccc ttg ccg age etc gtc cac ccc agg acg ggc-3' 
(SEQ ID NO:291) to introduce the D551S and the I553V mutations. 

23. Construction of Tth DN RX HT GL H641 A, Tth DN RX HT GL 
H748A, Tth DN RX HT GL H786A 

Site specific mutagenesis was performed on pTrc99A Tth DN RX HT GL DNA 
using the mutagenic primer 583-001-02: 5'-gct tgc ggt ctg ggt ggc gat gtc ctt ccc ctc-3' 
(SEQ ID NO:294) to introduce the H641 A mutation (DNA sequence SEQ ID NO:2699; 
amino acid sequence SEQ ID NO:2665), or the mutagenic primer 583-001-03: 5' cat gtt 
gaa ggc cat ggc etc cgc ggc etc cct-3' (SEQ ID NO:295) to generate the H748A mutant 
(DNA sequence SEQ ID NO:2700; amino acid sequence SEQ ID NO:2666), or the 
mutagenic primer 583-001-04: 5'-cag gag gag etc gtt ggc gac ctg gag gag-3' (SEQ ID 
NO:296) to generate the H786A mutant enzyme (DNA sequence SEQ ID NO:2701; 
amino acid sequence SEQ ID NO:2667). 

24. Construction of Tth DN RX HT GL (H786A/G506K/Q509K) 

Starting with the mutant Tth DN RX HT GL H786A, generated above, site 
specific mutagenesis was done using the mutagenic primer 604-022-02: 5'-gga gcg ctt 
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gcc tgt ctt ctt cgt ctt ctt caa ggc ggg agg cct-3' (SEQ ID NO:297) to generate this variant 
termed "TthAKK GL", (DNA sequence SEQ ID NO:2702; amino acid sequence SEQ ID 
NO:2668). 

25, TthDN RX HT H786A random mutagenesis 

To generate mutants in the heUx-hairpin-hehx region of the TthDN RX HT 
H786A GL (SEQ ID NO:2667) enzyme, two different PGR reactions were performed 
using the H786A (SEQ ID NO:2701) mutant as a template. Starting with TthDN GL 
H786A discussed above, and using primer 604-08-06: 5'-gtc gga ggg gtc ccc cac gag-3' 
(SEQ ID NO:313) and primer 390-76-08: 5'-tgt gga att gtg age gg (SEQ ID NO:314), a 
620 base pair PGR fragment was generated. PGR reactions were performed using the 
Advantage cDNA PGR kit (Glontech) according to manufacturer's instructions. This 
PGR product includes amino acids 1-194. No mutations were introduced via this 
reaction, however the restriction enzyme site EcoRI is present at the 5' end. 

Starting with TthDN RX HT GL H786A discussed above, and using mutagenic 
primer 604-08-05: 5'-ctc gtg ggg gac ccc tec gac aac etc (ccc ggg gtc aag ggc ate ggg 
gag aag acc gcc) etc aag ctt etc aag-3' (SEQ ID NO:315) and primer 209-74-02: 5'-gtg 
gcc tec ata tgg gcc agg ac-3' (SEQ ID NO :3 16) a 787 base pair PGR fragment was 
generated. PGR reactions were done as above. This fragment does contain random 
mutations, due to the presence of the mutagenic primer, 604-08-05. The bases within the 
parenthesis of this primer were synthesized such that 91% of the sequence is wild-type, 
while the additional 9% is evenly divided between the remaining 3 bases. 

The two PGR fragments overlap, and were combined in a recombinant PGR 
reaction. Primers 390-76-08 and 209-74-02 were added, and the Advantage cDNA PGR 
kit (Glontech) was again used according to manufacturer's instructions. A 1380 base pair 
product was generated from this reaction. 

The recombinant PGR product was cut with the restriction enzymes EcoRI and 
NotI according to the manufacturer's instructions to yield a 986 base pair fragment. 
TthDN RX HT GL H786A was prepared by cutting with the same enzymes. The 
fragment was then ligated into the vector, and transformed into JM109 cells. New 
mutants developed from this set of reactions include: 
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TthDN RX HT GL H786A/P197R/K200R (DNA sequence SEQ ID NO:2703; amino 
acid sequence SEQ ID NO:2669). 

TthDN RX HT GL H786A/K205Y (DNA sequence SEQ ID NO:2704; amino acid 
sequence SEQ ID NO:2670). 

TthDN RX HT GL H786A/G203R (DNA sequence SEQ ID NO:2705; amino acid 
sequence SEQ ID NO:2671). 

26. Constnictioii of TaqTthAKK GL (DNA sequence SEQ ID NO:2706; 
amino acid sequence SEQ ID NO:2672) 

Chimeric mutant TaqTthAKK GL was generated by cutting Tth DN RX HT GL 
(H786AyG506K/Q509K) (SEQ ID NO:2702; here abbreviated TthAKK GL) and Taq 4M 
G504 (SEQ ID NO: 108; here abbreviated Taq 5M) with the restriction endonucleases 
EcoRI and Notl. The smaller insert fragment of Taq 5M and the larger vector fragment 
of Tth AKK GL were gel purified as detailed in Example 3D, and the insert fragments 
were ligated as described in Example 3D. Screening and verification of the construct 
sequence was also done as in Example 3D. 

27. Construction of TfiTth AKK GL (DNA sequence SEQ ID NO:2707; 
amino acid sequence SEQ ID NO:2673), TscTthAKK GL (DNA sequence SEQ ID 
NO:2708; amino acid sequence SEQ ID NO:2674) 

To generate chimeric enzymes between TthAKK GL (SEQ ID NO:2702), and Tfi 
DN 2M (SEQ ID NO: 130), or Tsc DN 2M (SEQ ID NO: 13 1), Tth DN RX HT GL 
(H786A/G506K/Q509K) DNA (SEQ ID NO:2702) was cut with EcoRI and Notl, and the 
larger vector fragment was gel isolated as above. The 993 base pair Eco RI-NotI 
fragements of Tfi and Tsc were prepared as described in Example 8. Ligations were 
performed as detailed in Example 3D, as was the screening and verification of the new 
constructs. 

28. Construction of TthAKK GL (P195A) and TthAKK GL (P195K) 

To introduce mutations at amino acid position 195 (either a P195A or P195K) in 
the nuclease domain of TthAKK GLconstruct, mutagenic primer 785-073-01 (P195A) 5'- 
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ccc-tcc-gac-aac-ctc-gcc-ggg-gtc-aag-ggc-atc-3' (SEQ ID NO: 370) or 785-073-02 
(P195K) 5'-ccc4cc-gac-aac-ctc-aag-ggg-gtc-aag-ggc-atc-3' (SEQ ID NO: 371) and 
primer 209-074-02: 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) were used in 
a PGR reaction to generate a 787 base pair fragment. Another PGR fragment was 
5 obtained by using the primers: 390-076-08 5 '-tgt-gga-att-gtg-agc-gg-3 ' (SEQ ID 

NO:314) and 785-073-03 5'-gag-gtt-gtc-gga-ggg-gtc-3' (SEQ ID NO: 372) in areaction 
with the same template. 

The two PGR fragments overlap and were combined in a recombinant PGR 
reaction. The outside primers 390-076-08 and 209-074-02 were added, and the 
10 Advantage cDNA PGR kit (Glontech) was used according to manufacturer's instructions. 
A 1380 base pair product was generated from this reaction. 

O The recombinant PGR product was cut with the restriction enzymes EcoRI and 

O 

5l NotI to yield a 986 base pair fragment. The TthAKK GLconstruct was prepared by 

J cutting with the same enzymes. The jfragment was then hgated into the vector, and 

Ml 

W 15 transformed into JMl 09 cells. New mutants developed from this set of reactions include: 
f TthAKK GL(P195 A) (DNA sequence SEQ ID NO: 2721 ; amino acid sequence SEQ ID 
g NO: 2722) and TthAKK GL (P195K) (DNA sequence SEQ ID NO: 723; amino acid 
ri sequence SEQ ID NO:2724). 

m 

ri 20 29. Construction of TthAKK GL(N417K/L418K) 

The same approach was used to construct TthAKK GL (N417K/L418K). Two 
overlapping PGR fragments were generated by mutagenic primers: 785-73-07 5'-gag- 
agg-ctc-cat-cgg-aag-aag-ctt-aag-cgc-ctc-gag-3' (SEQ ID NO: 377) and 700-10-03 5'-ctt- 
ctc-tca-tcc-gcc-aaa-aca-gcc-3' (SEQ ID NO:343), and primers 158-084-01 5'-ctc-ctc-cac- 
25 gag-ttc-ggc-3' (SEQ ID NO:310) and 785-73-08 5'-ccg-atg-gag-cct-ctc-cga-3' (SEQ ID 
NO: 378). The two products were combined and amplified with outside primers 700-10- 
03 and 158-084-01. The recombinant PGR product was cut with the restriction enzymes 
NotI and BarnHI and ligated into the Notl/BamHI pre-cut TthAKK GL construct. This 
mutant was termed TthAKK GL (N417K/L418K) (DNA sequence SEQ ID NO: 2725; 
30 amino acid sequence SEQ ID NO: 2726). 



30. Construction of TthAKK GL (P255L) 

Site specific mutagenesis was performed on the TthAKK GL constract using the 
mutagenic primer 886-049-05 and 886-049-06: 5'-gtg-cgc-acc-gac-ctc-ctc-ctg-gag-gtg- 
gac-ctc-3' (SEQIDNO: 381), 5'-gag-gtc-cac-ctc-cag-gag-gag-gtc-ggt-gcg-cac-3' (SEQ 
ID NO: 382) to generate TthAKK GL (P255L) (DNA sequence SEQ ID NO: 2727; 
amino acid sequence SEQ ID NO: 2728). 

31. Construction of TthAKK GL (F311Y) 

Site specific mutagenesis was performed on the TthAKK GL construct using the 
mutagenic primer 886-049-09 and 886-049-10: 5'-ggg-gcc-ttc-gtg-ggc-tac-gtc-ctc-tcc- 

cgc-ccc-3' (SEQ ID NO: P385), 5'"-ggg-gcg-gga-gag-gac-gta-gcc-cac-gaa-ggc-ccc-3' 
(SEQ ID NO: 386) to generate TthAKK GL (F311Y) (DNA sequence SEQ ID NO:2729; 
amino acid sequence SEQ ID NO:2730). 

32. Construction of TthAKK GL (N221H/R224Q) 

Site specific mutagenesis was performed on the TthAKK GL construct using the 
mutagenic primer 886-049-01 and 886-049-02: 5'-gaa-aac-ctc-ctc-aag-cac-ctg-gac-cag- 

gta-aag-cca-gaa-aac-3' (SEQ ID NO: 389), 5'-gtt-ttc-tgg-ctt-tac-ctg-gtc-cag-gtg-ctt-gag- 
gag-gtt-ttc-3' (SEQ ID NO: 390) to generate TthAKK GL (N221H/R224Q) (DNA 
sequence SEQ ID NO: 2731; amino acid sequence SEQ ID NO: 2732). 

33. Construction of TthAKK GL (R251H) 

Site specific mutagenesis was performed on TthAKK GL constract using the 
mutagenic primer 886-049-03 and 886-049-04: 5'-gag-ctc-tcc-cgg-gtg-cac-acc-gac-ctc- 
ccc-ctg-3' (SEQIDNO: 393), 5'-cag-ggg-gag-gtc-ggt-gtg-cac-ccg-gga-gag-ctc-3' (SEQ 
ID NO: 394) to generate TthAKK GL (R251H) (DNA sequence SEQ ID NO:2733; 
amino acid sequence SEQ ID NO:2734). 

34. Construction of TthAKK GL (P255L/R251H) 

Site specific mutagenesis was performed on the TthAKK GL (P255L) construct 
using the mutagenic primer 886-088-01 and 886-088-02: 5'-gag-ctc-tcc-cgg-gtg-cac-acc- 
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gac-ctc-ctc-ctg-3' (SEQIDNO: 397), 5'-cag-gag-gag-gtc-ggt-gtg-cac-ccg-gga-gag-ctc-3' 
(SEQ ID NO: 398) to generate TthAKK GL (P255L/R251H) (DNA sequence SEQ ID 
NO:2735; amino acid sequence SEQ ID NO:2736). 



5 35. Construction of Tth AKK GL L429V (DNA sequence SEQ ID 

NO:2737; amino acid sequence SEQ ID NO:2738) 

Palm region randomization mutagenesis was performed on the Tth AKK GL 
construct using the mutagenic primer 680-79-02 5 '-etc cat egg aac etc ctt [aag cgc etc gag 
ggg gag gag aag etc ctt tgg] etc tac cac gag gtg-3' (SEQ ID NO: 403) and reverse primer 
10 680-79-04 5'-AAG GAG GTT CCG ATG GAG-3' (SEQ ID NO: 404) to introduce the 
y random mutations in the Palm region. Brackets indicate a synthesis of 9 1 % base shown 

O and 3% all other bases. 

m 

W 36. Construction of Tth AKK GL E425V (DNA sequence SEQ ID 

15 NO:2739; amino acid sequence SEQ ID NO:2740) 

Palm region raadomization mutagenesis was perfomied on the Tth AKK GL 
fl construct using the mutagenic primer 680-79-02 5 '-etc cat egg aac etc ctt [aag cgc etc gag 
ii ggg gag gag aag etc ctt tgg] etc tac cac gag gtg-3' (SEQ ID NO: 403) and reverse primer 
y 680-79-04 5'-AAG GAG GTT CCG ATG GAG-3' (SEQ ID NO: 404) to introduce the 
20 random mutations in the Palm region. Brackets indicate a synthesis of 91% base shown 
and 3% all other bases. 

37. Construction of Ttli AKK GL L422N/E425K (DNA sequence SEQ ID 
NO:2741; amino acid sequence SEQ ID NO:2742) 

25 Palm region randomization mutagenesis was performed on the Tth AKK GL 

construct using the mutagenic primer 680-79-02 5 '-etc cat egg aac etc ctt [aag cgc etc gag 
ggg gag gag aag etc ctt tgg] etc tac cac gag gtg-3' (SEQ ID NO: 403) and reverse primer 
680-79-04 5'-AAG GAG GTT CCG ATG GAG-3' (SEQ ID NO: 404) to introduce the 
random mutations in the Palm region. Brackets indicate a synthesis of 91% base shown 

30 and 3% all other bases. 
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38. Construction of Tth AKK GL L422FAV430C (DNA sequence SEQ ID 
NO:2743; amino acid sequence SEQ ID NO:2744) 

Palm region randomization mutagenesis was performed on Tth AKK GL DNA 
using the mutagenic primer 680-79-02 5 '-etc cat egg aac etc ctt [aag cgc etc gag ggg gag 
gag aag etc ctt tgg] etc tac cac gag gtg-3' (SEQ ID NO: 403) and reverse primer 680-79- 
04 5'-AAG GAG GTT CCG ATG GAG-3' (SEQ JD NO: 404) to introduce the random 
mutations in the Palm region. Brackets indicate a synthesis of 91% base shown and 3% 
all other bases. 

39. Construction of Tth AKK GL A504F (DNA sequence SEQ ID 
NO:2745; amino acid sequence SEQ ID NO:2746) 

Site saturation mutagenesis was performed on Tth AKK GL construct using the 
mutagenic primer 680-80-03 5'-cag gag ctt agg ctt ccc nnn ttg aag aag acg aag aag aca-3' 
(SEQ ID NO: 413) and reverse primer 680-80-06 5'-cct aag etc gtc aaa gag-3' (SEQ ID 
NO: 414) to introduce the random mutations in the A504 amino acid. 

40. Construction of Tth AKK GL A504V (DNA sequence SEQ ID 
NO:2747; amino acid sequence SEQ ID NO:2748) 

Site saturation mutagenesis was performed on Tth AKK GL construct using the 
mutagenic primer 680-80-03 5'-cag gag ctt agg ctt ccc nnn ttg aag aag acg aag aag aca-3' 
(SEQ ID NO: 413) and reverse primer 680-80-06 5'-cct aag etc gtc aaa gag-3' (SEQ ID 
NO: 414) to introduce the random mutations in the A504 amino acid. 

41. Construction of Tth AKK GL A504S (DNA sequence SEQ ID 
NO:2749; amino acid sequence SEQ ID NO: 2750) 

Site saturation mutagenesis was performed on Tth AKK GL construct using the 
mutagenic primer 680-80-03 5'-cag gag ctt agg ctt ccc nnn ttg aag aag acg aag aag aca-3' 
(SEQ ID NO: 413) and reverse primer 680-80-06 5'-cct aag etc gtc aaa gag-3' (SEQ ID 
NO: 414) to introduce the random mutations in the A504 amino acid. 
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42. Construction of Tth AKK GL S517G (DNA sequence SEQ ID 
NO:2751; amino acid sequence SEQ ID NO: 2752) 

Site saturation mutagenesis was performed on Tth AKK GL construct using the 
mutagenic primer 680-80-07 5'-GGC AAG CGC TCC ACC NNN GCC GCG GTG CTG 
5 GAG GCC CTA CGG-3' (SEQ ID NO: 421) and reverse primer 680-80-10 5'-GGT 
GGA GCG CTT GCC-3' (SEQ ID NO: 422) to introduce the random mutations in the 
S517 amino acid. 

43 Construction of Tth AKK GL A518L (DNA sequence SEQ ID 
^10 NO:2753; amino acid sequence SEQ ID NO:2754) 

Q Site saturation mutagenesis was performed on Tth AKK GL construct using the 

si mutagenic primer 680-80-07 5 '-GGC AAG CGC TCC ACC AGC NNN GCG GTG CTG 
£ GAG GCC CTA CGG-3' (SEQ ID NO: 425) and reverse primer 680-80-10 5'-GGT 
fi GGA GCG CTT GCC-3 ' (SEQ ID NO:422) to introduce the random mutations in the 
15 A518 amino acid. 

is 

PI 

m 44. Construction of m AKK GL A518R (DNA sequence SEQ ID 

S NO:2755; amino acid sequence SEQ ID NO:2756) 

ft Site saturation mutagenesis was performed on Tth AKK GL construct using the 

20 mutagenic primer 680-80-07 5 '-GGC AAG CGC TCC ACC AGC NNN GCG GTG CTG 
GAG GCC CTA CGG-3' (SEQ ID NO: 425) and reverse primer 680-80-10 5'-ggt gga 
gcg ctt gcc-3' (SEQ ID NO: 422) to introduce the random mutations in the A518 amino 
acid. 

25 45. Construction of Tth AKK GL A504K (DNA sequence SEQ ID 

NO:2757; amino acid sequence SEQ ID NO:2758) 

Site directed mutagenesis was performed on the Tth AKK GL construct using the 
mutagenic primer 680-69-04 5'-ctt agg ctt ccc aag ttg aag aag acg aag aag aca-3' (SEQ 
ID NO: 433) and reverse primer 680-69-05 5'-tgt ctt ctt cgt ctt ctt caa ctt ggg aag cct aag- 

30 3 ' (SEQ ID NO: 434) to introduce the A504K mutation in the Tth AKK GL enzyme. 
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46. Construction of Tth AKK GL H641 A (DNA sequence SEQ ID 
NO:2759; amino acid sequence SEQ ED NO:2760) 

Site directed mutagenesis was performed on Tth AKK GL construct using the 
mutagenic primer 680-69-08 5 '-gag ggg aag gac ate gcc acc cag acc gca agc-3' (SEQ ID 
NO: 437) and reverse primer 583-01-02 5'-gct tgc ggt ctg ggt ggc gat gtc ctt ccc etc- 3' 
(SEQ ID NO: 438) to introduce the H641 A mutation in the Tth AKK GL enzyme. 

47. Construction of Tth AKK GL T508P (DNA sequence SEQ ID 
NO:2761; amino acid sequence SEQ ID NO:2762) 

Site directed mutagenesis was performed on TthAKK GL construct using the 
mutagenic primer 680-70-01 5'-ccc-gcc ttg aag aag cog aag aag aca ggc aag-3' (SEQ ID 
NO: 441) and reverse primer 680-70-02 5'-ctt gcc tgt ctt ctt egg ctt ctt caa ggc ggg-3' 
(SEQ ID NO: 442) to introduce the T508P mutation in the Tth AKK GL enzyme. 

48. Construction of Taq(FT)TthAKK GL 

Site specific mutagenesis was performed on the TaqTthAKK GL construct using 
the mutagenic primer 473-087-05: 5'-cgg-gac-ctc-gag-gcg-cgt-gaa-ccc-cag-gag-gtc-cac- 
3' (SEQ ID NO: 483) to introduce the L107F/A108T mutations and generate 
Taq(FT)TthAKK GL (DNA sequence SEQ ID NO:2779; amino acid sequence SEQ ID 
NO:2780). 

49. Construction of Tfi(FT)TthAKK GL 

Site specific mutagenesis was performed on the TfiTthAKK GL construct using 
the mutagenic primers 785-096-01: 5'-gtg-gac-ctt-ctg-ggc-ttt-acc-cgc-ctc-gag-gcc-ccg-3' 
(SEQ ID NO: 486) and 785-096-02: 5'-cgg-ggc-ctc-gag-gcg-ggt-aaa-gcc-cag-aag-gtc- 
cac-3' (SEQ ID NO: 487) to introduce the L107FA^108T mutations and generate 
Tfi(FT)TthAKK GL (DNA sequence SEQ ED NO:2720; amino acid sequence SEQ ID 
NO:2781). 

50. Construction of Tsc(FT)TthAKK GL 

Site specific mutagenesis was performed on the TscTthAKK GL construct using 
the mutagenic primers 785-008-03 5'-ttt-acc-cgc-ctc-gag-gtg-ccg-ggc-3' (SEQ ID NO: 
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489) and reverse primer 680-21-03 5'-cgg cac etc gag gcg ggt aaa gee caa aag gtc cac-3' 
(SEQ ID NO: 490) to introduce the L107F/E108T mutations and generate 
Tsc(FT)TthAKK GL (DNA sequence SEQ ID NO:2764; amino acid sequence SEQ ID 
NO:2782). 

5 

51. Fusion between TthAKK GL enzyme and alpha-peptide 

A TthAKK GL-lacZ-alpha-peptide chimeric fusion was constructed to allow 
detection of mutations (including frame-shifts, deletions, insertions, etc.) which cause the 
inability of expression of the full-length fusion protein based on the colony blue-white 
10 screening (Wu et aL, Nucleic Acids Research, 24:1710 [1996]). 
H Site specific mutagenesis was performed on TthAKK GL DNA using the 

jSj mutagenic primers 959-041-03, 5'-cac-cac-cac-cac-cac-cac-gtc-gac-tag-tgc-tag-cgt-cga- 
f cta-gct-gca-ggc-atg-caa-gct-tgg-c-3' (SEQ ID NO: 477) and 959-041-04, 5 '-gcc-aag-ctt- 
B8 gca-tgc-ctg-cag-cta-gtc-gac-gct-agc-aet-agt-cga-cgt-ggt-ggt-ggt-ggt-ggt-g-3' (SEQ ID 
J 15 NO: 478) to generate pTthAKK GL-L with Sail site following the 6xHis tag at the C- 

terminus of TthAKK GL for the insertion of lacZ alpha peptide. The alpha peptide (of 
If 201 amino acids) was first PGR amplified firom the pCRH-TOPO vector (Invitrogen) with 
p| primers 959-041-01, 5'-cag-gaa-gcg-gcc-gcg-tcg-aca-tga-cca-tga-tta-cgc-caa-gc-3' (SEQ 
2 ID NO: 479) and 959-093-01, 5'-ggg-ccc-gcc-agg-gtc-gac-tca-ggg-cga-tgg-ccc-act-acg- 
' 20 tga-3' (SEQ ID NO: 480). The PGR product was then digested with restriction enzyme 
Sail and ligated into the pTthAKK GL-L vector, which was also cut with the same 
enzyme to generate the chimeric construct TthAKK GL-alpha peptide (DNA sequence 
SEQ ID NO:2777; amino acid sequence SEQ ID NO:2778). The orientation of the insert 
was confirmed by sequencing. 

25 

52. Construction of TaqEFT-Tth(AKK) GL (DNA sequence SEQ ID 
NO:2790; amino acid sequence SEQ ID NO:2791) 

Site specific mutagenesis was performed on Taq(FT)-Tth(AKK) GL DNA (SEQ 
ID NO: 2779) using the mutagenic primers 436-013-08: 5'-atc-gtg-gtc-ttt-gac-gcc-gag- 
30 gcc-ccc-tcc-ttc-c-3 ' (SEQ ID NO:503) and 436-01 3-09 5'-gga-agg-agg-ggg-cct-cgg-cgt- 
caa-aga-cca-cga-t-3' (SEQ ID NO:504) to introduce the K70E mutation. 
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53. Construction of TaqEFT-Tth(AKK) GL-AMl (DNA sequence SEQ 
ID NO:2792; amino acid sequence SEQ ID NO:2793) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) GL DNA (SEQ 
ID NO:2790) using the mutagenic primers 1044-038-01: 5'-cag-acc-atg-aat-tcg-gag-gcg- 
atg-ctg-ccc-ctc-ttt-3' (SEQ ID NO:507) and 1044-038-02: (SEQ ID NO: 508) 
5'-aaa-gag-ggg-cag-cat-cgc-ctc-cga-att-cat-ggt-ctg-3' to introduce the G4EA mutation. 

54. Construction of TaqEFT-Tth(AKK) GL-B/M2 (DNA sequence SEQ 
ID NO:2794; amino acid sequence SEQ ID NO: 2795) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) GL DNA (SEQ 
ID NO: 2790) using the mutagenic primers 1044-038-03: 5'-gcc-tac-cgc-acc-ttc-ttt-gcc- 
ctg-aag-ggc-ctc-3 and 1044-038-04: 5'-gag-gcc-ctt-cag-ggc-aaa-gaa-ggt-gcg-gta-ggc-3' 
(SEQ ID NO: 511 and 512, respectively) to introduce the H29F mutation. 

55. Construction of TaqEFT-Ttli(AKK) GL-C/M3 (DNA sequence SEQ 
ID NO:2796; amino acid sequence SEQ ID NO:2797) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) GL DNA 
(SEQ ID NO: 2790) using the mutagenic primers 1044-038-05: 5'-ctc-ctc-aag-gcc-ctc- 
aga-gag-gac-ggg-gac-gcg-3' and 1044-038-06: 5'-cgc-gtc-ccc-gtc-ctc-tct-gag-ggc-ctt- 
gag-gag-3' (SEQ ID NO: 515 and 516, respectively) to introduce the IC57R mutation. 

56. Construction of TaqEFT-Tth(AKK) GL-D/M5 (DNA sequence SEQ 
ID NO:2798; amino acid sequence SEQ ID NO:2799) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) GL DNA (SEQ 
ID NO: 2790) using the mutagenic primers 1044-038-09: 5'-gac-gac-gtc-ctg-gcc-acc-ctg- 
gcc-aag-aag-gcg-3' and 1044-038-10: 5'-cgc-ctt-ctt-ggc-cag-ggt-ggc-cag-gac-gtc-gtc-3' 
(SEQ ID NO: 519 and 520, respectively) to introduce the S125T mutation. 
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57. Construction of TaqEFT-Tth(AKK) GL-E/M6 (DNA sequence SEQ 
ID NO:2800; amino acid sequence SEQ ID NO:2801) 

Site specific mutagenesis was performed on Taq(EFr)-Tth(AKK) GL DNA (SEQ 
ID NO: 2790) using the mutagenic primers 1044-038-11: 5'-ggg-gag-aag-acg-gcg-ctc- 
aag-ctt-ctg-gag-gag-3' and 1044-038-12: 5'-ctc-ctc-cag-aag-ctt-gag-cgc-cgt-ctt-ctc-ccc-3' 
(SEQ ID NO: 523 and 524, respectively) to introduce the R206L mutation. 

58. Construction of TaqEFT-Ttli(AKiq GL-F/M7 (DNA sequence SEQ 
ID NO:2802; amino acid sequence SEQ ID NO:2803) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) GL DNA (SEQ 
ID NO:2790) using the mutagenic primers 1044-038-13: 5'-gag-ccc-gac-cgg-gag-ggg-ctt- 
aag-gcc-ttt-ctg-gag-agg-3' and 1044-038-14: 5'-cct-ctc-cag-aaa-ggc-ctt-aag-ccc-ctc-ccg- 
gtc-ggg-ctc-3 (SEQ ID NO: 527 and 528, respectively) to introduce the R269G and 
R27 IK mutations. 

59. Construction of TaqEFT-Tth(AKK) GL-G/M8 (DNA sequence SEQ 
ID NO:2804; amino acid sequence SEQ ID NO:2805) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) GL DNA (SEQ 
ED NO: 2790) using the mutagenic primers 1044-038-15: 5'-cac-gag-ttc-ggc-ctt-ctg-gga- 
ggg-gag-aag-ccc-cgg-gag-gag-gcc-ccc-tgg-ccc-3' and 1044-038-16: 5'-ggg-cca-ggg-ggc- 
ctc-ctc-ccg-ggg-ctt-ctc-ccc-tcc-cag-aag-gcc-gaa-ctc-gtg-3' (SEQ ID NO: 531 and 532, 
respectively) to introduce the E290G, S291G, P292E, A294P, and L295R mutations. 

60. Construction of TaqEFT-Ttli(AKK) GL-H/M9 (DNA sequence SEQ 
ID NO:2806; amino acid sequence SEQ ID NO:2807) 

Site specific mutagenesis was performed on Taq(EFT)-Tth(AKK) GL DNA (SEQ 
ID NO: 2790) using the mutagenic primers 1044-038-17: 5'-ctg-gcc-ctg-gcc-gcc- tgc- 
agg-ggc-ggc-cgc-gtg-3' and 1044-038-18: 5'-cac-gcg-gcc-gcc-cct-gca-ggc-ggc-cag-ggc- 
cag-3' (SEQ ID NO: 535 and 536, respectively) to introduce the A328C mutation. 



262 



61. Construction of TaqEFT-Tth(AKK) GL-I/MIO (DNA sequence SEQ 
ID NO:2808; amino acid sequence SEQ ID NO:2809) 

Site specific mutagenesis was perfonned on Taq(EFT)-Tth(AKK) GL DNA (SEQ 
ID NO: 2790) using the mutagenic primers 1080-015-01 (SEQ ID NO:539) 5'-ggg gag 

5 aag acg gcg agg aag ctt ctg aag gag tgg ggg agc-3 ' and 1 080-01 5-02 (SEQ ID NO: 540) 
5 '-get ccc cca etc ctt cag aag ctt cct cgc cgt ctt etc ccc-3' to introduce the E210K 
mutation. 

62. Construction of TaqEFT-Tth(AKK) GL-Ml-9 (DNA sequence SEQ 
10 ID NO:2810; amino acid sequence SEQ ID NO:2811) 

Q Seven independent PGR reactions were performed, using construct TaqEFT- 

!S Tth(AKK) GL (SEQ ID: 2790) as a template, with the following pairs of mutagenic 
^ primers: PGR Reaction 1; 1044-038-01 5'-cag-acc-atg-aat-tcg-gag-gcg-atg-ctg-ccc-ctc- 
5 ttt-3' (SEQ ID NO:507) and 1044-038-04 5'-gag-gcc-ctt-cag-ggc-aaa-gaa-ggt-gcg-gta- 
^ 15 ggc-3' (SEQ ID NO:512) to yield a 108 base pau: fragment, PGR Reaction 2; 1044-038- 
O 03 5 '-gcc-tac-cgc-acc-ttc-ttt-gcc-ctg-aag-ggc-ctc-3' (SEQ ID NO: 5 11) and 1044-038-06 
ll 5'-cgc-gtc-ccc-gtc-ctc-tct-gag-ggc-ctt-gag-gag-3' (SEQ ID NO: 516) to yield a 1 17 base 
iJ pair fragment, PGR Reaction 3; 1044-038-05 5'-ctc-ctc-aag-gcc-ctc-aga-gag-gac-ggg- 
il gac-gcg-3' (SEQ ID NO: 5 1 5) and 1 044-038-1 0 5 '-cgc-ctt-ctt-ggc-cag-ggt-ggc-cag-gac- 
20 gtc-gtc-3' (SEQ ID NO: 520) to yield a 237 base pair fragment, PGR Reaction 4; 1 044- 
038-09 5'-gac-gac-gtc-ctg-gcc-acc-ctg-gcc-aag-aag-gcg-3' (SEQ ID NO: 519) and 1044- 
038-12 5'-ctc-ctc-cag-aag-ctt-gag-cgc-cgt-ctt-ctc-ccc-3' (SEQ ID NO: 524) to yield a 276 
base pair fragment, PGR Reaction 5; 1044-038-11 5'-ggg-gag-aag-acg-gcg-ctc-aag-ctt- 
ctg-gag-gag-3' (SEQ ID NO: 523) and 1044-038-14 5'-cct-ctc-cag-aaa-ggc-ctt-aag-ccc- 
25 ctc-ccg-gtc-ggg-ctc-3' (SEQ ID NO: 528) to yield a 228 base pair fragment, PGR 
Reaction 6; 1044-038-13 5'-gag-ccc-gac-cgg-gag-ggg-ctt-aag-gcc-ttt-ctg-gag-agg-3' 
(SEQ ID NO: 527) and 1044-038-16 5'-ggg-cca-ggg-ggc-ctc-ctc-ccg-ggg-ctt-ctc-ccc-tcc- 
cag-aag-gcc-gaa-ctc-gtg-3' (SEQ ID NO: 532) to yield a 1 13 base pair fragment, PGR 
Reaction 7; 1044-038-15 5'-cac-gag-ttc-ggc-ctt-ctg-gga-ggg-gag-aag-ccc-cgg-gag-gag- 
30 gcc-ccc-tgg-ccc-3' (SEQ ID NO: 531) and 1044-038-18 5'-cac-gcg-gcc-gcc-cct-gca-ggc- 
ggc-cag-ggc-cag-3' (SEQ ID NO: 536) to yield a 157 base pair fragment. The seven PGR 



263 



products overlap such that PGR ampUfication in the absence of primers yielded the 
appropriate 1005 base pair product to introduce the K70E, G4EA, H29F, K57R, S125T, 
R206L, R269G, R271K, E290G, S291G, P292E, A294P, L295R, and A328C mutations. 
The product was further amplified using the outside primers: 1044-038-1 (SEQ ID NO: 
5 507) and 1044-038-18 (SEQ ID NO: 536) and cloned into pPCR-Script-Amp. From this 
construct, the nuclease domain was again ampUfied using primers: 1044-038-1 (SEQ ID 
NO: 507) and 1044-038-18 (SEQ ID NO: 536) and digested with EcoRI and Notl. The 
digested PGR product was Ugated into an EcoRI and Notl digested TaqEFT-Tth(AKK) 
GL construct and transformed into JMl 09 for protein expression and screening. 

10 

63. Construction of TaqEFr-Tth(AKK) GL-Ml-1 0 (DNA sequence SEQ 
2 ID NO:2812; amino acid sequence SEQ ID NO:2813) 

m Seven independent PGR reactions were performed, using construct TaqEFT- 

J Tth(AKK) GL (SEQ ID: 2790) as a template, with the following pairs of mutagenic 
W 15 primers: PGR Reaction 1; 1044-038-01 5'-cag-acc-atg-aat-tcg-gag-gcg-atg-ctg-ccc-ctc- 
r ttt-3' (SEQ ID NO:507) and 1 044-038-04 5'-gag-gcc-ctt-cag-ggc-aaa-gaa-ggt-gcg-gta- 
S ggc-3' (SEQ ID NO:512) to yield a 108 base pair fragment, PGR Reaction 2; 1044-038- 

il 03 5'-gcc-tac-cgc-acc-ttc-ttt-gcc-ctg-aag-ggc-ctc-3" (SEQ ID NO: 51 1) and 1044-038-06 

HI 

Q 5'-cgc-gtc-ccc-gtc-ctc-tct-gag-ggc-ctt-gag-gag-3' (SEQ ID NO: 516) to yield a 1 17 base 
^ 20 pair fragment, PGR Reaction 3; 1044-038-05 5 '-ctc-ctc-aag-gcc-ctc-aga-gag-gac-ggg- 
gac-gcg-3' (SEQ ID NO: 515) and 1044-038-10 5'-cgc-ctt-ctt-ggc-cag-ggt-ggc-cag-gac- 
gtc-gtc-3' (SEQ ID NO: 520) to yield a 237 base pair fragment, PGR Reaction 4; 1044- 
038-09 5'-gac-gac-gtc-ctg-gcc-acc-ctg-gcc-aag-aag-gcg-3' (SEQ ID NO: 519) and 1080- 
42-02 5'-gc ttc cag get ccc cca etc ctt cag aag ctt gag cgc cgt ctt etc ccc-3' (SEQ ID NO: 
25 546) to yield a 299 base pair fragment, PGR Reaction 5; 1080-42-01 5'-ggg gag aag acg 
gcg etc agg ctt ctg aag gag tgg ggg age ctg gaa gc-3' (SEQ ID NO: 545) and 1044-038-14 
5'-cct-ctc-cag-aaa-ggc-ctt-aag-ccc-ctc-ccg-gtc-ggg-ctc-3' (SEQ ID NO: 528) to yield a 
228 base pair fragment, PGR Reaction 6; 1044-038-13 5'-gag-ccc-gac-cgg-gag-ggg-ctt- 
aag-gcc-ttt-ctg-gag-agg-3' (SEQ ID NO: 527) and 1044-038-16 5'-ggg-cca-ggg-ggc-ctc- 
30 ctc-ccg-ggg-ctt-ctc-ccc-tcc-cag-aag-gcc-gaa-ctc-gtg-3' (SEQ ID NO: 532) to yield a 113 
base pair fragment, PGR Reaction 7; 1044-038-15 5'-cac-gag-ttc-ggc-ctt-ctg-gga-ggg- 
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gag-aag-ccc-cgg-gag-gag-gcc-ccc-tgg-ccc-3' (SEQ IDNO: 531) and 1044-038-18 5'-cac- 
gcg-gcc-gcc-cct-gca-ggc-ggc-cag-ggc-cag-3' (SEQ ID NO: 536) to yield a 157 base pair 
fragment. The seven PGR products overlap such that PGR amplification in the absence 
of primers yielded the appropriate 1005 base pair product to introduce the K70E, G4EA, 
H29F, K57R, S125T, R206L, E210K, R269G, R271K, E290G, S291G, P292E, A294P, 
L295R, and A328C mutations. The product was further amplified using the outside 
primers: 1044-038-1 (SEQ ID NO: 507) and 1044-038-18 (SEQ ID NO: 536) and cloned 
into pPCR-Script-Amp. From this construct, the nuclease domain was again ampUfied 
using primers: 1044-038-1 (SEQ ID NO: 507) and 1044-038-18 (SEQ ID NO: 536) and 
digested with EcoRI and Not! The digested PGR product was ligated into an EcoRI and 
NotI digested TaqEFT-Tth(AKK) GL construct and transformed into JM109 for enzyme 
expression and screening. 

64. Construction of Tth(K69E)AKK, Taq(K69E)TthAKK, 
Tfi(K69E)TthAKK and Tsc(K69E)TthAKK and mutants 

Site specific mutagenesis was performed on TthAKK GL, TaqTthAKK GL, 
TfiTthAKK GL and TscTthAKK GL DNAs using the mutagenic primers 5'-atc-gtg-gtc- 
ttt-gac-gcc-gag-gcc-ccc-tcc-ttc-c-3' (SEQ ID NO: 492) and 5'-gga-agg-agg-ggg-cct-cgg- 
cgt-caa-aga-cca-cga-t-3' (SEQ ID NO: 493) to introduce the K69E mutation and to 
generate Tth(K69E)AKK GL (DNA sequence SEQ ID NO:2763; amino acid sequence 
SEQ ID NO:2783), Taq(K69E)ThAKK GL, (DNA sequence SEQ ID NO:2784; amino 
acid sequence SEQ ID NO:2785), Tfi(K69E)TthAKK GL (DNA sequence SEQ ID 
NO:2786; amino acid sequence SEQ ID NO:2787)and Tsc(K69E)TthAKK: GL (DNA 
sequence SEQ ID NO:2788; amino acid sequence SEQ ID NO:2789) mutant enzymes. 

65. Construction of Tsc(167-334)TthAKK GL 

Two overlapping PGR fragments were generated by primers 390-76-08: 5'-tgt- 
gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 1044-041-01: 5'-ctt-ctc-tca-tcc-gcc-aaa-aca- 
gcc-3' (SEQ ID NO: 451) with template Tth(K69E)AKK GL, (DNA sequence SEQ ID 
NO:2763), and primers 1044-041-02: 5'-ctc-ctc-cac-gag-ttc-ggc-3' (SEQ IDNO: 453) 
and 209-074-02: 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) with template 
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Tsc(K69E)TthAKK GL, (DNA sequence SEQ ID NO:2764). Two products were 
combined and amplified with outside primers 390-76-08 and 209-074-02. The 
recombinant PGR product was cut with the restriction enzymes EcoRI and NotI and 
ligated into the vector TthAKK GL which was prepared by cutting with the same 
5 enzymes to yield Tsc(l 67-333)TthAKK GL construct (DNA sequence SEQ ID NO:2765; 
amino acid sequence SEQ ID NO:2766). 

66. Construction of Tsc(222-334)TthAKK GL 

Two overlapping PGR fragments were generated by primers 390-76-08 : 5 ' -tgt- 
10 gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 1044-041-03: 5'-ttc-cag-gtg-ctt-gag-gag-gtt- 
U ttc-cag-3' (SEQ ID NO: 457) with template Tth(K69E)AKK GL (DNA sequence SEQ ID 
2 NO:2763), and primers 1044-041-04: 5'-ctc-ctc-aag-cac-ctg-gaa-cag-gtg-aaa-3' (SEQ ID 
fi> NO: 458) and 209-074-02: 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) with 
S template Tsc(K69E)TthAKK GL, (DNA sequence SEQ ID NO:2764). Two products 
^'J 1 5 were combined and amplified with outside primers 390-76-08 and 209-074-02. The 

« recombinant PGR product was cut with the restriction enzymes EcoRI and NotI and 

n 

f| ligated into the vector TthAKK GL which was prepared by cutting with the same 
2 enzymes to yield Tsc(222-334)TthAKK GL construct (DNA sequence SEQ ID NO:2767; 

0 amino acid sequence SEQ ID NO:2768). 
20 

67. Construction of Tfi(222-334)TthAKK GL 

Two overlapping PGR firagments were generated by primers 390-76-08: 5'-tgt- 
gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 1044-058-05: 5'-gtc-cag-gtt-ctt-gag-gag-gtt- 
ttc-cag-3' (SEQ ID NO: 461) with template Tth(K69E)AKK GL (DNA sequence SEQ ID 

25 NO:2763), and primers 1044-058-06: 5'-ctc-ctc-aag-aac-ctg-gac-cgg-gta-aag-3' (SEQ ID 
NO: 462) and 209-074-02: 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID NO:316) with 
template Tfi(K69E)TthAKK GL (DNA sequence SEQ ID NO:2786). Two products were 
combined and amphfied with outside primers 390-76-08 and 209-074-02. The 
recombinant PGR product was cut with the restriction enzymes EcoRI and NotI and 

30 ligated into the vector TthAKK GL which was prepared by cutting with the same 
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enzymes to yield Tfi(222-334)TthAKK GL construct (DNA sequence SEQ ID NO:2769; 
amino acid sequence SEQ ID NO:2770). 

68. Construction of Tfi(167-334)TthAKK GL 

5 Two overlapping PGR fragments were generated by primers 390-76-08 5'-tgt- 

gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 1044-058-07 5'-gac-gtc-ctt-cgg-ggt-gat- 
gag-gtg-gcc-3' (SEQ ID NO: 465) with template Tth(K69E)AKK GL, (DNA sequence 
SEQ IDNO:2763), and primers 1044-058-08 5'-atc-acc-ccg-aag-gac-gtc-cag-gag-aag-3' 
(SEQ ID NO: 466) and 209-074-02 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID 
10 NO:316) with template Tfi(K69E)TthAKK GL, (DNA sequence SEQ ID NO:2787). 
5 , Two products were combined and amplified with outside primers 390-76-08 and 209- 
0 074-02. The recombinant PGR product was cut with the restriction enzymes EcoRI and 
|i NotI and Ugated into the vector TthAKK GL which was prepared by cxrtting with the 
;f same enzymes to yield Tfi(167-333)TthAKK GL construct (DNA sequence SEQ JD NO: 
W 1 5 277 1 ; amino acid sequence SEQ ID NO:2772). 

t: 

9 69. Constructioii of Tsc(l ll-334)TtIiAKK GL 

m 

ri Two overlapping PCR fragments were generated by primers 390-76-08 5'-tgt- 

Q gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 1044-058-01 5'-ctc-gag-gcg-ggt-aaa-ccc- 
^ 20 cag-gag-gtc-3' (SEQ ID NO: 469) with template Tth(K69E)AKK GL (DNA sequence 
SEQ ID NO:2763), and primers 1044-058-02 5'-ggg-ttt-acc-cgc-ctc-gag-gtg-ccc-ggc-3' 
(SEQ ID NO: 470) and 209-074-02 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID 
NO:316) with template Tsc(K69E)TthAKK GL (DNA sequence SEQ ID NO:2788). 
Two products were combined and amplified with outside primers 390-76-08 and 209- 
25 074-02. The recombinant PCR product was cut with the restriction enzymes EcoRI and 
NotI and Ugated into the vector TthAKK GL which was prepared by cutting with the 
same enzymes to yield Tsc(167-333)TthAKK GL construct (DNA sequence SEQ ID 
NO:2773; amino acid sequence SEQ ID NO:2774). 
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70. Construction of Tsc(l-167)TthAKK GL 

Two overlapping PCR fragments were generated by primers 390-76-08 5'-tgt- 
gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 1044-058-03 5'-aag-cca-ctc-cgg-ggt-gat- 
cag-gta-acc-3' (SEQ ID NO: 473) with template Tsc(K69E)TthAKK GL (DNA sequence 

5 SEQ ID NO:2788), and primers 1044-058-04 5'-atc-acc-ccg-gag-tgg-ctt-tgg-gag-aag-3' 
(SEQ ID NO: 474) and 209-074-02 5'-gtg-gcc-tcc-ata-tgg-gcc-agg-ac-3' (SEQ ID 
NO:316) with template Tth(K69E)AKK GL (DNA sequence SEQ ID NO:2763). Two 
products were combined and amplified with outside primers 390-76-08 and 209-074-02. 
The recombinant PCR product was cut with the restriction enzymes EcoRI and NotI and 

1 0 Ugated into the vector TthAKK GL which was prepared by cutting with the same 

enzymes to yield Tsc(l -1 67)TthAKK GL construct (DNA sequence SEQ ID NO:2775; 

CI amino acid sequence SEQ ID NO:2776). 

w 

f 71. Construction of Afii336-Tth296(AKK) GL, ATl-1 

y 1 5 Two overlapping PCR fragments were generated. The first fragment was made 

'f^ using the AfuFEN construct (SEQ ID NO:556) as template with the primers: 390-076-08 
0 5'-tgt-gga-att-gtg-agc-gg-3 ' (SEQ ID N0:3 14) and 390-065-05 5'-gaa-cca-cct-ctc-aag- 
fj cgt-gg-3' (SEQ ID NO: 554). The second fragment was made using TthAKK GL (SEQ 
15 ID NO:2702) as template and the primers: 700-049-0 1 5 '-acg-ctt-gag-agg-tgg-ttc-ctg- 

11 20 gag-gag-gcc-ccc-tgg-3' (SEQ ID NO: 555) and 390-076-09 5 '-taa-tct-gta-tca-ggc-tg-3 ' 
(SEQ ID NO:557). The two products contain a region of sequence overlap, and were 
combined and amplified with outside primers 390-076-08 and 390-076-09 in a 
recombinant PCR reaction. The recombinant PCR product was cut wifli the restriction 
enzymes BspEI and Sail and ligated into the AiuFEN construct which was prepared by 
25 cutting with the same enzymes to yield Afu336-Tth296(AKK) GL construct (DNA 
sequence SEQ ID NO:2814; amino acid sequence SEQ ID NO:2815). 

72. Construction of Afu328-Tth296(AKK) GL, AT2-3 

Two overlapping PCR fragments were generated, tiie first by primers: 390-076-08 
30 5'-tgt-gga-att-gtg-agc-gg-3' (SEQ ID NO:314) and 700-049-02 5'-ggt-tga-ctt-cag-agc-ttt- 
gag-3' (SEQ ID NO: 561) with template AfiiFEN (DNA sequence SEQ ID NO:556), and 
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the second by primers: 700-049-03 5'-aaa-gct-ctg-aag-tca-acc-ctg-gag-gag-gcc-ccc-tgg-3' 
(SEQ ID NO: 562) and 390-076-09 5'-taa-tct-gta-tca-ggc-tg-3' (SEQ ID NO:557) with 
template TthAKK GL (DNA sequence SEQ ID NO:2702). The two products were 
combined and amplified with outside primers 390-076-08 and 390-076-09. The 
recombinant PCR product was cut with the restriction enzymes BspEI and Sail and 
hgated into the vector p AfuFEN which was prepared by cutting with the same enzymes 
to yield Afix336-Tth296(AKK) GL construct (DNA sequence SEQ ID NO:2816; amino 
acid sequence SEQ ID NO:2817). 

The procedure for random chimerization desribed above was used to develop the 
following chimeras, 

73. Generation of Random Chimeras S26 and 836 

The templates used to develop these chimeric enzymes were the nuclease domains 
from TthAKK GL, TaqTthAKK GL, TscTthAKK GL and the TfiTthAKK GL constructs. 
Two clones were found to show improvement of activity based on primary activity 
screening. Random chimeras S26 GL (DNA sequence SEQ ID NO:2818; amino acid 
sequence SEQ ID NO:2819) and S36 (DNA sequence SEQ ID NO:2820; amino acid 
sequence SEQ ID NO:2821) were then sequenced and isolated. 

74. Construction of S26(FT) GL 

Site specific mutagenesis was performed on pS26 GL DNA using the mutagenic 
primers: 785-008-03 5'-ttt-acc-cgc-ctc-gag-gtg-ccg-ggc-3' (SEQ ID NO: 489) and 680- 
21-03 5'-cgg-cac-ctc-gag-gcg-ggt-aaa-gcc-caa-aag-gtc-cac-3* (SEQ ED NO: 490) to 
introduce the L107F/E108T mutations and generate S26(FT) GL (DNA sequence SEQ 
ID NO:2822; amino acid sequence SEQ ID NO:2823). 

75. Construction of S36(FT) GL 

Site specific mutagenesis was performed on pS36 GL DNA using the mutagenic 
primers: 785-096-01: 5'-gtg-gac-ctt-ctg-ggc-ttt-acc-cgc-ctc-gag-gcc-ccg-3' (SEQ ID NO: 
486) and 785-096-02: 5'-cgg-ggc-ctc-gag-gcg-ggt-aaa-gcc-cag-aag-gtc-cac-3' (SEQ ID 
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NO: 487) to introduce the L109F/V110T mutations and generate S36(FT) GL (DNA 
sequence SEQ ID NO: 2824; amino acid sequence SEQ ID NO:2825). 

76. Construction of S26(K69E) GL 

5 Site specific mutagenesis was performed on S26 GL DNA using the mutagenic 

primers: 5'-atc-gtg-gtc-ttt-gac-gcc-gag-gcc-ccc-tcc-ttc-c-3' (SEQ ID NO: 492) and 5'- 
gga-agg-agg-ggg-cct-cgg-cgt-caa-aga-cca-cga-t-3' (SEQ ID NO: 493) to introduce the 
K69E mutation and to generate S26(K69E) GL (DNA sequence SEQ ID NO:2826; amino 
acid sequence SEQ ID NO:2827). 

10 

5j 77. Construction of S26(FT/K69E)GL 

O Site specific mutagenesis was performed on S26(FT) GL DNAs using the 

mutagenic primers: 5'-atc-gtg-gtc-ttt-gac-gcc-gag-gcc-ccc-tcc-ttc-c-3 ' (SEQ ID NO: 492) 
S and 5'-gga-agg-agg-ggg-cct-cgg-cgt-caa-aga-cca-cga-t-3' (SEQ ID NO: 493) to introduce 

15 the K69E mutation and to generate S26(FT/K69E) GL (DNA sequence SEQ ID 
p NO:2828; amino acid sequence SEQ ID NO:2829). 

m 

O 78. More random chimeras, N3D7, N1A12 N1C4, and N2C3 

|S The templates used to generate these chimerics were the nuclease domains of 

20 Tth(K69E)AKK GL, Taq(K69E)TthAKK GL, Tsc(K69E)TthAKK GL and 

Tfi(K69E)TthAKK GL constructs. Four clones were shown to have improved activity 
based on the primary activity screening. Random chimeras N3D7 GL (DNA sequence 
SEQ ID NO:2830; amino acid sequence SEQ ED NO:2831), N1A12 GL (DNA sequence 
SEQ ID NO:2832; amino acid sequence SEQ ID NO:2833), N1C4 GL (DNA sequence 
25 SEQ ID NO:2834; amino acid sequence SEQ ID NO:2835) and N2C3 GL (DNA 
sequence SEQ ID NO:2836; amino acid sequence SEQ ID NO:2837) were then 
sequenced and isolated respectively. 

Generation of DL, GR and DR variants of TthAKK GL enzyme 

A series of TthAKK GL variants having a Gly to Asp change at amino 
30 acid 385 (G385D) and/or a Leu to Arg change at amino acid 455 (L455R), were created 
by site-directed mutagenesis. Constructs comprising both of these two amino acid 
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variations is indicated as "DR". Chimeras comprising only one variation are indicated as 
either TDL' or 'GR'. 

1 . Constnictioii of TthAKK DL mutant 

Site specific mutagenesis was performed on pTrc99A TthAKK GL DNA (SEQ 
ID NO: 2702) using the mutagenic primers 1 144-60-01, 5'-cgc-cta-cct-cct-gga-ccc-ttc- 
gaa-cac-cac-c-3' (SEQ ID NO:2858) and 1144-60-02, 5'-ggt-ggt-gtt-cga-agg-gtc-cag-gag- 
gta-ggc-g-3' (SEQ ID NO:2859) to introduce the G383D mutation to yield TthAKK DL 
(DNA sequence SEQ ID NO:2838; amino acid sequence SEQ ID NO:2839). 

2. Construction of TthAKK GR mutant 

Site specific mutagenesis was performed on pTrc99A TthAKK GL DNA (SEQ 
ID NO: 2702) using the mutagenic primers 1 144-60-03, 5'-ggt-acg-gcg-gga-cgt-ggc-cta- 
cct-tca-g-3' (SEQ ID NO:2860) and 1144-60-04, 5'-ctg-aag-gta-ggc-cac-gtc-ccg-ccg-tac- 
c-3' (SEQ ID NO:2861) to introduce the L453R mutation to yield TthAKK GR (DNA 
sequence SEQ ID NO:2840; amino acid sequence SEQ ID NO:2841). 

3. Construction of TthAKK DR mutant 

Site specific mutagenesis was performed on pTrc99A TthAKK GL DNA 
(SEQ ID NO: 2702) using the mutagenic primers 1 144-60-01, 1 144-60-02, 1 144-60-03 
and 1144-60-04 (SEQ ID NOS:2858, 2859, 2860 and 2861, respectively) to introduce the 
G383D and L453R mutations to yield TthAKK DR mutant (DNA sequence SEQ ID 
NO:2842; amino acid sequence SEQ ID NO:2843). 

4. Construction of TthAKK(F29R) GL 

Site specific mutagenesis was performed on pTrc99A TthAKK GL DNA 
(SEQ ID NO: 2702) using the mutagenic primers 1249-26-01, 5'-cac-ctg-gcc-tac-cgc- 
acc-cgc-ttc-gcc-ctg-aag-ggc-ctc-3' (SEQ ID NO:2862) and 1249-26-02, 5'-gag-gcc-ctt- 
cag-ggc-gaa-gcg-ggt-gcg-gta-ggc-cag-gtg-3' (SEQ ID NO:2863) to introduce the F29R 
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mutation to yield the TthAKK(F29R) GL mutant (pNA sequence SEQ E) NO:2844; 
amino acid sequence SEQ JD NO:2845). 

5. Construction of TthAKK(F29R/F30H)GL 

Site specific mutagenesis was performed on pTrc99A TthAKK GL DNA (SEQ 
5 ED NO: 2702) using the mutagenic primers 1249-26-03, 5'-cac-ctg-gcc-tac-cgc-acc-cgc- 
cac-gcc-ctg-aag-ggc-ctc-acc-3' (SEQ ID NO:2864) and 1249-26-04, 5'-ggt-gag-gcc-ctt- 
cag-ggc-gtg-gcg-ggt-gcg-gta-ggc-cag-gtg-3' (SEQ ID NO:2865) to introduce the 
F29R/F30H mutations to yield the TthAKK(F29R/F30H) GL mutant (DNA sequence 
SEQ ID NO:2846; amino acid sequence SEQ ID NO:2847). 

10 

=^ 6. Construction of TthAKK(F29R/F30R)GL 

O Site specific mutagenesis was performed on pTrc99A TthAKK GL DNA (SEQ 

O 

CI ID NO: 2702) using the mutagenic primers 1 249-26-05 , 5 ' -cac-ctg-gcc-tac-cgc-acc-cgc- 
il cgc-gcc-ctg-aag-ggc-ctc-acc-3'(SEQ ID NO:2866) and 1249-26-06, 5 '-ggt-gag-gcc-ctt- 
W 1 5 cag-ggc-gcg-gcg-ggt-gcg-gta-ggc-cag-gtg-3' (SEQ ID NO:2867) to introduce the 
r F29R/F30R mutations to yield the TthAKK(F29R/F30R) GL mutant (DNA sequence 
[| SEQ ID NO:2848; amino acid sequence SEQ ID NO:2849). 

s ^ 
fP 

il 7. Construction of Tsc(F26R)TthAKKGL 

' ^ 20 Site specific mutagenesis was performed on pTrc99A TscTth(AKK) GL DNA 

(SEQ ID NO: 2708) using the mutagenic primers 1249-26-01, 5'-cac-ctg-gcc-tac-cgc- 
acc-cgc-ttc-gcc-ctg-aag-ggc-ctc-3' (SEQ ID NO:2862) and 1249-26-02, 5'-gag-gcc-ctt- 
cag-ggc-gaa-gcg-ggt-gcg-gta-ggc-cag-gtg-3' (SEQ ID NO:2863) to introduce the F26R 
mutation to yield the Tsc(F26R)TthAKK GL mutant (DNA sequence SEQ ID NO:2850; 
25 amino acid sequence SEQ ID NO:285 1). 

8. Construction of Tsc(F26R/F27H)Tth AKK GL 

Site specific mutagenesis was performed on pTrc99A TscTthAKK GL DNA 

(SEQ ID NO: 2708) using the mutagenic primers 1249-26-03, 5'-cac-ctg-gcc-tac-cgc- 
30 acc-cgc-cac-gcc-ctg-aag-ggc-ctc-acc-3' (SEQ ID NO:2864) and 1249-26-04, 5'-ggt-gag- 

gcc-ctt-cag-ggc-gtg-gcg-ggt-gcg-gta-ggc-cag-gtg-3' (SEQ ID NO:2865) to introduce the 
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F26R/F27H mutations to yield the Tsc(F26R/F27H)TthAKK GL mutant (DNA sequence 
SEQ ID NO:2852; amino acid sequence SEQ ID NO:2853). 

9. Construction of Tsc(F26R/F27R)TthAKK GL 

Site specific mutagaiesis was perfonned on pTrc99A TscTthAKK GL DNA 
5 (SEQ ID NO: 2708) using the mutagenic primers 1249-26-05, 5'-cac-ctg-gcc-tac-cgc- 
acc-cgc-cgc-gcc-ctg-aag-ggc-ctc-acc-3' (SEQ ID NO:2866) and 1249-26-06, 5'-ggt-gag- 
gcc-ctt-cag-ggc-gcg-gcg-ggt-gcg-gta-ggc-cag-gtg-3' (SEQ ID NO:2867) to introduce the 
F26R/F27R mutations to yield the Tsc(F26R/F27R)TthAKK GL mutant (DNA sequence 
SEQ ID NO:2854; amino acid sequence SEQ ID NO:2855). 

10 

10. Construction of Taq(F28R)TthAKKGL 

O Site specific mutagenesis was performed on pTrc99A-TaqTthAKK GL DNA 

fi (SEQ ID NO: 2706) using the mutagenic primers 1249-26-03, 5'-cac-ctg-gcc-tac-cgc- 
J acc-cgc-cac-gcc-ctg-aag-ggc-ctc-acc-3' (SEQ ID NO:2864) and 1249-26-04, 5'-ggt-gag- 
y 15 gcc-ctt-cag-ggc-gtg-gcg-ggt-gcg-gta-ggc-cag-gtg-3' (SEQ ID NO:2865) to introduce the 
^ F28R mutation and generate Taq(F28R)TthAKK GL mutant (DNA sequence SEQ ID 
g NO:2856; amino acid sequence SEQ ID NO:2857). 



W 20 EXAMPLE 10 

Test For The Dependence Of An Enzyme On 
The Presence Of An Upstream Oligonucleotide 

When choosing a structure-specific nuclease for use in a sequential invasive 
cleavage reaction it is preferable that the enzyme have little ability to cleave a probe 1) in 

25 the absence of an upstream oUgonucleotide, and 2) in the absence of overlap between the 
upstream oligonucleotide and the downstream probe oligonucleotide. Figs, 26a-e depicts 
several structures that can be used to examine the activity of an enzyme is confronted 
with each of these types of structures. The structure a (Fig. 26a) shows the aUgnment of 
a probe oligonucleotide with a target site on bacteriophage Ml 3 DNA (Ml 3 sequences 

30 shown in Fig. 26 are provided in SEQ ID NO:217) in the absence of an upstream 

oligonucleotide. Structure b (Fig. 26b) is provided with an upstream oligonucleotide that 
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does not contain a region of overlap with the labeled probe (the label is indicated by the 
star). In structures c, d and e (Figs. 26c-e) the upstream oligonucleotides have overlaps 
of 1, 3 or 5 nucleotides, respectively, with the downstream probe oligonucleotide and 
each of these structures represents a suitable invasive cleavage structure. The enzyme 

5 Pfu FEN-1 was tested for activity on each of these structures and all reactions were 
performed in duplicate. 

Each reaction comprised 1 (iM 5' TET labeled probe oligonucleotide 89-15-1 
(SEQ ID NO:212), 50 nM upstream oligonucleotide (either oUgo 81-69-2 [SEQ ID 
NO:216], oligo 81-69-3 [SEQ ID NO:215], oligo 81-69-4 [SEQ ID NO:214], oligo 

10 81-69-5 [SEQ ID NO:213], or no upstream oUgonucleotide), 1 finol M13 target DNA, 10 
mg/ml tRNA and 10 ng oiPfu FEN-1 in 10 |il of 10 mM MOPS (pH 7.5), 7.5 mM MgCb 



O with 0.05% each of Tween 20 and Nonidet P-40 



m 

• -ST 



All of the components except the enzyme and the MgCb were assembled m a 
final volume of 8 |j,l and were overlaid with 1 0 |a,l of CHILLOUT hquid wax. The 



^'^f 1 5 samples were heated to the reaction temperature of 69°C. The reactions were started by 
the addition of the Pfu FEN-1 and MgCl2, in a 2 |al volume. After incubation at 69°C for 



30 minutes, the reactions were stopped with 10 |il of 95% formamide, 10 mM EDTA, 
0.02% methyl violet. Samples were heated to 90°C for 1 min inmiediately before 
electrophoresis through a 20% denaturing acrylamide gel (19:1 cross-linked), with 7 M 

20 urea, in a buffer of 45 mM Tris-Borate, pH 8.3, 1.4 mM EDTA. Gels were then analyzed 
with a FMBIO-100 Hitachi FMBIO fluorescence imager. The resulting image is 
displayed in Fig. 27. 

In Fig. 27, lanes labeled "a" contain the products generated from reactions 
conducted without an upstream oligonucleotide (structure a), lanes labeled "b" contain an 

25 upstream oligonucleotide that does not invade the probe/target duplex (structure b). 

Lanes labeled "c", "d" and "e" contain the products generated from reactions conducted 
using an upstream ohgonucleotide that invades the probe/target duplex by 1, 3 or 5 bases, 
respectively. The size (in nucleotides) of the uncleaved probe and the cleavage products 
is indicated to the left of the image in Fig. 27. Cleavage of the probe was not detectable 

30 when structures a and b were utiUzed. In contrast, cleavage products were generated 

when invasive cleavage structures were utihzed (structures c-e). These data show that the 
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Pfu FEN-1 enzyme requires an overlapping upstream oligonucleotide for specific 
cleavage of the probe. 

Any enzyme may be examined for its suitability for use in a sequential invasive 
cleavage reaction by examining the ability of the test enzyme to cleave structures a-e (it 
5 is understood by those in the art that the specific oligonucleotide sequences shown in 
Figs. 26a-e need not be employed in the test reactions; these structures are merely 
illustrative of suitable test structures). Desirable enzymes display Uttle or no cleavage of 
structures a and b and display specific cleavage of structures c-e (z.e., they generate 
cleavage products of the size expected fi-om the degree of overlap between the two 
10 oligonucleotides employed to form the invasive cleavage structure). 

O EXAMPLE 11 

la Use Of The Products Of A First Invasive Cleavage Reaction To Enable A Second 

Invasive Cleavage Reaction With A Net Gain In Sensitivity 

15 As discussed in the Description of The Invention above, the detection sensitivity 

of the invasive cleavage reaction can be increased by the performing a second round of 
invasive cleavage using the products of the first reaction to complete the cleavage 
structure in the second reaction. In this Example, the use of a probe that, when cleaved in 
a first invasive cleavage reaction, forms an integrated INVADER oligonucleotide and 
20 target molecule for use in a second invasive cleavage reaction, is illustrated. 

A first probe was designed to contain some internal complementarity so that when 
cleaved in a first invasive cleavage reaction the product ("Cut Probe 1 ") could form a 
target strmd comprising an integral INVADER oligonucleotide. A second probe was 
provided in the reaction that would be cleaved at the intended site when hybridized to the 
25 newly formed target/INVADER molecule. To demonstrate the gain in signal due to the 
performance of sequential invasive cleavages, a standard invasive cleavage assay, as 
described above, was performed in parallel. 

All reactions were performed in duplicate. Each standard non-sequential) 
invasive cleavage reaction comprised 1 |a,M 5' fluorescein-labeled probe oligo 073-182 (5* 
30 Fl-AGAAAGGAAGGGAAGAAAGCGAA-3'; SEQ ID NO:591), 10 nM upstream oligo 



I '-ST 
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81-69-4 (5'-CTTGACGGGGAAAGCCGGCGAACGTGGCGA-3'; SEQ ID NO:214), 10 
to 100 attomoles of M13 target DNA, 10 mg/ml tRNA and 10 ng oiPJu FEN-1 in 10 lal 
of 10 mM MOPS (pH 7.5), 8 mM MgCl2 with 0.05% each of Tween 20 and Nonidet 
P-40. All of the components except the enzyme and the MgCli were assembled in a 
5 volume of 7 (il and were overlaid with 1 0 |ul of CHILLOUT liquid wax. The samples 
were heated to the reaction temperature of 62°C. The reactions were started by the 
addition of the Pfu FEN-1 and MgCb, in a 2 ^1 volume. After incubation at 62°C for 30 
minutes, the reactions were stopped with 10 |4,1 of 95% formamide, 10 mM EDTA, 0.02% 
methyl violet. 

10 Each sequential invasive cleavage reaction comprised 1 |j,M 5' fluorescein-labeled 

Q ohgonucleotide 073-191 (the furst probe or "Probe 1 ", 5' Fl-TGGAGGTCAAAACATCG 
ij ATAAGTCGAAGAAAGGAAGGGAAGAAAT-3'; SEQ ID NO:592), 10 nM upstream 
i oligonucleotide 81-69-4 (5'- CTTGACGGGGAAA GCCGGCGAACGTGGCGA-3'; 
y SEQ ID NO:214), 1 \xM. of 5' fluorescein labeled oligonucleotide 106-32 (the second 

^ 15 probe or "Probe 2", 5' Fl-TGTTTTGACCT CCA-3'; SEQ ID NO:593), 1 to 100 amol of 

s 

0 M13 target DNA, 10 mg/ml tRNA and 10 ng oiPfu FEN-1 in 10 ^1 of 10 mM MOPS 
fl (pH 7.5), 8 mM MgCl2 with 0.05% each of Tween 20 and Nonidet P-40. All of tiie 
5^ components except the enzyme and the MgCl2 were assembled in a volimie of 8 |al and 
^ were overlaid with 10 |iil of CHILLOUT liquid wax. The samples were heated to the 
20 reaction temperature of 62^C (this temperature is the optimum temperature for annealing 
of Probe 1 to the first target). The reactions were started by the addition of Pfu FEN-1 
and MgCl2, in a 2 ^1 volume. After incubation at 62°C for 15 minutes, the temperature 
was lowered to 58°C (this temperature is the optimum temperature for annealing of Probe 
2 to the second target) and the samples were incubated for another 15 min. Reactions 
25 were stopped by the addition of 10 |il of 95% formamide, 20 mM EDTA, 0.02% methyl 
violet. 

Samples firom both the standard and the sequential invasive cleavage reactions 
were heated to 90° C for 1 min immediately before electrophoresis through a 20% 
denaturing acrylamide gel (19:1 cross-linked), with 7 M urea, in a buffer of 45 mM 
30 Tris-Borate, pH 8.3, 1 .4 mM EDTA. The gel was then analyzed with a Molecular 
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Dynamics Fluorlmager 595. The resulting image is displayed in Fig. 28a. A graph 
showing measure of fluorescence intensity for each of the product bands is shown in Fig. 
28b. 

In Fig. 28a, lanes 1-5 contain the products generated in standard invasive cleavage 
5 reactions that contained either no target (lane 1), 10 amol of target (lanes 2 and 3) or with 
100 amol of target (lanes 4 and 5). The uncleaved probe is seen as a dark band in each 
lane about halfway down the panel and the cleavage products appear as a smaller black 
band near the bottom of the panel, the position of the cleavage product is indicated by an 
arrow head to the left of Fig. 28a. The gray ladder of bands seen in lanes 1-5 is due to the 
10 thermal degradation of the probe and is not related to the presence or absence of the 
target DNA. The remaining lanes display products generated in sequential invasive 
cleavage reactions that contained 1 amol of target (lanes 6 and 7), 10 amol of target 
(lanes 8 and 9) and 100 amol of target (lanes 10 and 11). The uncleaved first probe 
(Probe 1; labeled "1 uncut") is seen near the top of the panel, while the cleaved first probe 
15 is indicated as " 1 : cut". Similarly, the imcleaved and cleaved second probe are indicated 

w 

iH as "2: uncut" and 2: cut," respectively. 

JL^ The graph shown in Fig. 28b compares the amoimt of product generated from the 

fl standard reaction ("Series 1") to the amount of product generated Scorn the second step of 

the sequential reaction ("Series 2"). The level of background fluorescence measured 
hi 20 fi:om a reaction that lacked target DNA was subtracted from each measurement. It can be 
seen from the table located below the graph that the signal from the standard mvasive 
cleavage assay that contained 100 attomoles of target DNA was nearly identical to the 
signal from the sequential invasive cleavage assay in which 1 attomole of target was 
present, indicating that the inclusion of a second cleavage structure increases the 
25 sensitivity of the assay 100 to 200-fold. This boost in signal allows easy detection of 

target nucleic acids at the sub attomole level using the sequential invasive cleavage assay, 
while the standard assay, when performed using this enzyme for only 30 minutes, does 
not generate detectable product in the presence of 10 attomoles of target. 

When the amount of target was decreased by 10 or 100 fold in the sequential 
30 invasive cleavage assay, the intensity of the signal was decreased by the same proportion. 
This indicates that the quantitative capability of the invasive cleavage assay is retained 
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even when reactions are performed in series, thus providing a nucleic acid detection 
method that is both sensitive and quantitative. 

While in this Example, the two probes used had different optimal hybridization 
temperatures (i.e,, the temperature empiracally determined to give the greatest turnover 
rate in the given reaction conditions), the probes may also be selected (Le,, designed) to 
have the same optimal hybridization temperature so that a temperature shift during 
incubation is not necessary. 

EXAMPLE 12 

The Products Of A Completed Sequential Invasive Cleavage Reaction Cannot Cross 
Contaminate Subsequent Similar Reactions 

As discussed in the Description of the Invention, the serial nature of the multiple 
invasive cleavage events that occur in the sequential invasive cleavage reaction, in 
contrast to the reciprocating nature of the polymerase chain reaction and similar doubling 
assays, means that the sequential invasive cleavage reaction is not subject to 
contamination by the products of like reactions because the products of the first cleavage 
reaction do not participate in the generation of new signal in the second cleavage 
reaction. If a large amount of a completed reaction were to be added to a newly 
assembled reaction, the background that would be produced would come from the 
amount of target that was also carried in, combined with the amount of already-cleaved 
probe that was carried in. In this Example, it is demonstrated that a very large portion of 
a primary reaction must be introduced into the secondary reaction to create significant 
signal. 

A first or primary sequential invasive cleavage reaction was performed as 
described above using 100 amol of target DNA. A second set of 5 reactions were 
assembled as described in Ex. 1 1 with the exception that portions of the first reaction 
were introduced and no additional target DNA was included. These secondary reactions 
were initiated and incubated as described above, and included 0, 0.01, 0.1, 1, or 10% of 
the first reaction material. A control reaction including 100 amol of target was included 
in the second set also. The reactions were stopped, resolved by electrophoresis and 
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visualized as described above, and the resulting image is displayed in Fig. 29. The 
primary probe, uncut second probe and the cut 2nd probe are indicated on the left as " 1 : 
cut", 2; uncut" and 2: cut", respectively. 

In Fig. 29, lane 1 shows the results of the first reaction with the accumulated 
product at the bottom of the panel, and lane 2 show a 1:10 dilution of the same reaction, 
to demonstrate the level of signal that could be expected from that level of contamination, 
without further amplification. Lanes 3 through 7 show the results of the secondary 
cleavage reactions that contained 0, 10, 1, 0,1 or 0.01% of the first reaction material 
added as contaminant, respectively and lane 8 shows a control reaction that had 100 amol 
of target DNA added to verify the activity of the system in the secondary reaction. The 
signal level in lane 4 is as would be expected when 10% of the pre-cleaved material is 
transferred (as in lane 2) and 10% of the transferred target material from the lane 1 
reaction is allowed to further amphfy. At all levels of further dilution the signal is not 
readily distinguished from background. These data demonstrate that while a large-scale 
transfer from one reaction to another may be detectable, cross contamination by the 
minute quantities that would be expected from aerosol or from equipment contamination 
would not be easily mistaken for a false positive result. These data also demonstrate that 
when the products of one reaction are deliberately carried over into a fresh sample, these 
products do not participate in the new reaction, and thus do not affect the level of target- 
dependent signal that may be generated in that reaction. 



EXAMPLE 13 

Detection of Human Cytomegalovirus Viral DNA by Invasive Cleavage 
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The previous Example demonstrates the abiUty of the invasive cleavage reaction 
to detect minute quantities of viral DNA m the presence of human genomic DNA. In this 
Example, the probe and INVADER oUgonucleotides were designed to target the 
3104-3061 region of the major immediate early gene of human cytomegalovirus 
5 (HCMV) as shown in Fig. 30. In Fig. 30, the INVADER ohgo (89-44; SEQ ID NO:219) 
and the fluorescein (Fl)-labeled probe ohgo (89-76; SEQ ID NO:218) are shown annealed 
along a region of the HCMV genome corresponding to nucleotides 3057-3 110 of the viral 
DNA (SEQ ID NO:220). The probe used in this Example is a poly-pyrimidine probe and 
as shown herein the use of a poly-pyrimidine probe reduces background signal generated 
10 by the thermal breakage of probe oligos. 

The genomic viral DNA was purchased from Advanced Biotechnologies, 

: a 
jssss 

Q Incorporated (Columbia, MD), The DNA was estimated (but not certified) by personnel 
2; at Advanced Biotechnologies to be at a concentration of 170 amol (1x10 copies) per 
microliter. The reactions were performed in quadruphcate. Each reaction comprised 1 
5 15 |J,M 5" fluorescein labeled probe oligonucleotide 89-76 (SEQ ID N0:21 8), 1 00 nM 

INVADER oligonucleotide 89-44 (SEQ ID N0:2 1 9), 1 ng/ml human genomic DNA, and 
9 one of five concentrations of target HCMV DNA in the amounts indicated above each 
ll lane in Fig. 31, and 10 ng of Pfu FEN-1 in 10 ^il of 10 mM MOPS (pH 7.5), 6 mM MgCb 
£ with 0.05% each of Tween 20 and Nonidet P-40. All of the components except the 
n 20 labeled probe, enzyme and MgCh were assembled in a final volume of 7 \il and were 
overlaid with 10 \xl of CHILLOUT Uquid wax. The samples were heated to 95^C for 5 
min, then reduced to 62°C. The reactions were started by the addition of probe, Pfu 
FEN-1 and MgCb, in a 3 |al volume. After incubation at 62°C for 60 minutes, the 
reactions were stopped with 10 |xl of 95% formamide, 10 mM EDTA, 0.02% methyl 
25 violet. Samples were heated to 90° C for 1 min immediately before electrophoresis 

through a 20% acrylamide gel (19:1 cross-linked), with 7 M urea, in a buffer of 45 mM 
Tris-Borate, pH 8.3, 1 .4 mM EDTA. Gels were then analyzed with a Molecular 
Dynamics Fluorlmager 595. 

The resulting image is displayed in Fig. 3 1 . The replicate reactions were nm in 
30 groups of four lanes with the target HCMV DNA content of the reactions indicated above 
each set of lanes (0-170 amol). The uncleaved probe is seen in the upper third of the 
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panel ("Uncut 89-76") while the cleavage products are seen in the lower two-thirds of the 
panel ("Cut 89-76"). It can be seen that the intensity of the accumulated cleavage product 
is proportional to the amount of the target DNA in the reaction. Furthermore, it can be 
clearly seen in reactions that did not contain target DNA ("no target") that the probe is 
5 not cleaved, even in a background of human genomic DNA. While 10 ng of human 
genomic DNA was included in each of the reactions shown in Fig. 31, inclusion of 
genomic DNA up to 200 ng has shght impact on the amount of product accumulated. 
The data did not suggest that 200 ng per 1 0 |il of reaction mixture represented the 
maximum amount of genomic DNA that could be tolerated without a significant 
10 reduction in signal accumulation. For reference, this amount of DNA exceeds what 
f^^ might be found in 0.2 ml of urine (a commonly tested amount for HCMV in neonates) 
S and is equivalent to the amount that would be found in about 5 jal of whole blood, 
to These results demonstrate that the standard (i.e., non-sequential) invasive 

1^ cleavage reaction is a sensitive, specific and reproducible means of detecting viral DNA. 

1 5 Detection of 1 .7 amol of target is roughly equivalent to detection of 1 0^ copies of the 
. virus. This is equivalent to the number of viral genomes that might be found in 0.2 mis 

fl of urine from a congenitally infected neonate (10^ to 10^ genome equivalents per 0.2 mis; 
j«| Stagno et al, J. Infect. Dis., 132:568 [1975]). Use of the sequential invasive cleavage 
O assay would permit detection of even fewer viral DNA molecules, facilitating detection 

' ' 20 in blood (10^ to 10^ viral particles per ml; Pector et al, J. Clin. Microbiol, 30:2359 
[1992]), which carries a much larger amount of heterologous DNA. 

From the above it is clear that the invention provides reagents and methods to 
permit the detection and characterization of nucleic acid sequences and variations in 
nucleic acid sequences. The INVADER-directed cleavage reaction and the sequential 
25 INVADER-directed cleavage reaction of the present invention provide ideal direct 

detection methods that combine the advantages of the direct detection assays (e.g., easy 
quantification and minimal risk of carry-over contamination) with the specificity 
provided by a dual or tri oligonucleotide hybridization assay. 

As indicated in the Description of the Invention, the use of sequential invasive 
30 cleavage reactions can present the problem of residual uncut fu-st, or primary, probe 
interacting with the secondary target, and either competing with the cut probe for 
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binding, or creating background through low level cleavage of the resulting structure. 
This is shown diagramatically in Figs. 32 and 33. In Fig. 32, the reaction depicted makes 
use of the cleavage product from the fnst cleavage structure to form an INVADER 
oligonucleotide for a second cleavage reaction. The structure formed between the 
5 secondary target, the secondary probe and the uncut primary probe is depicted in Fig. 32, 
as the right hand structure shown in step 2a. This structure is recognized and cleaved by 
the 5' nucleases, albeit very inefficiently (z.a, at less than about 1% in most reaction 
conditions). Nonetheless, the resulting product is indistinguishable from the specific 
product, and thus may lead to a false positive result. The same effect can occur when the 
1 0 cleaved primary probe creates and integrated INVADER/target (IT) molecule, as 
described in Example 1 1; the formation of the imdesirable complex is depicted 
schematically in Fig. 33, as the right hand structure shown in step 2a. 

O The improvements provided by the inclusion of ARRESTOR oUgonucleotides of 

til 

,;r: various compositions in each of these types of sequential INVADER assays are 

15 demonstrated in the following Examples. These ARRESTOR oligonucleotides are 



configured to bind the residual uncut probe from the first cleavage reaction in the series, 
thereby increasing the efficacy of and reducing the non-specific background in the 
subsequent reaction(s). 



In this Example, the effect of including an ARRESTOR oligonucleotide on the 
generation of signal using the IT probe system 33 is demonstrated. The ARRESTOR 

25 oligonucleotide hybridizes to the primary probe, mainly in the portion that recognizes the 
target nucleic acid during the first cleavage reaction. In addition to examining the effects 
of adding an ARRESTOR oligonucleotide, the effects of using ARRESTOR 
oligonucleotides that extended in complementarity different distances into the region of 
the primary probe that composes the secondary IT structure were also investigated. 

30 These effects were compared in reactions that included the target DNA over a range of 




EXAMPLE 14 



ARRESTOR" Oligonucleotides Improve Sensitivity of Multiple Sequential 

Invasive Cleavage Assays 
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concentrations, or that lacked target DNA, in order to demonstrate the level of 
nonspecific (i.e., not related to target nucleic acid) background in each set of reaction 
conditions. 

The target DNA for these reactions was a fi-agment that comprised the full length 
5 of the hepatitis B genome from strain of serotype adw. This material was created using 
the polymerase chain reaction from plasmid pAM6 (ATCC #45020D). The PCRs were 
conducted using a vector-based forward primer, oligo # 156-022-001 (5'- 
ggcgaccacacccgtcctgt-3'; SEQ ID NO:594) and a reverse primer, oligo #156-022-02 (5'- 
ccacgatgcgtccggcgtag-3'; SEQ ID NO:595) to amplify the full length of the HBV insert, 
10 an amplicon of about 3.2kb. The cycling conditions included a denatuxation of the 
plasmid at 95°C for 5 minutes, followed by 30 cycles of 95°C, 30 seconds; 60''C, 40 
[J seconds; and 72°C, 4 minutes. This was followed by a final extension at 72°C for 10 
|M minutes. The resulting ampHcon, termed pAM6#2, was adjusted to 2 M NH4OAC, and 
Si collected by precipitation wiht isopropanol. After drying in vacuo, DNA was dissolved 
^[15 in 1 0 mM Tris pH ,0, 0.1 mM EDTA, The concentration was determined by OD200 

^ measurement, and by INVADER assay with comparison to a standard of known 

Q 

S concentration. 

[i The INVADER reactions were conducted as follows. Five master mixes, termed 

g „g„ M HQ rr I, ^g^g assembled; all mixes contained 12.5 mM MOPS, pH 

20 7.5, 500 finoles primary INVADER oligo #218-55-05 (SEQ ID NO:596), 10 ng human 
genomic DNA (Novagen) and 30 ng AfuFENl enzyme, for every 8 ^il of mix. Mix A 
contained no added HBV genomic amplicon DNA; mix B contained 600 molecules of 
HBV genomic amplicon DNA pAM6 #2; mix C contained 6,000 molecules pAM6 #2; 
mix D contained 60,000 molecules pAM6 #2; and mix E contained 600,000 molecules 
25 pAM6 #2. The mixes were aliquotted to the reaction tubes, 8 |Lil/tube: mix A to tubes 1, 
2, 11, 12, 21 and 22; mix B to tubes 3, 4, 13, 14, 23 and 24; mix C to tubes 5, 6, 15, 16, 
25 and 26; mix D to tubes 7, 8, 17, 18, 27 and 28; and mix E to tubes 9, 10, 19, 20, 29 
and 30. The samples were incubated at 95°C for 4 minutes to denature the HBV genomic 
amplicon DNA. The reactions were then cooled to 67°C, and 2 |al of a mix containing 
30 37.5 mM MgCh and 2.5 pmoles 218-95-06 (SEQ ID NO:597) for every 2 j^l, was added 
to each sample. The samples were incubated at 67^C for 60 minutes. Three secondary 
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reaction master mixes were prepared, all mixes contained 10 pmoles of secondary probe 
oligonucleotide #228-48-04 (SEQ ID NO:598) for every 2 ^1 of mix. Mix 2A contained 
no additional oligonucleotide, mix 2B contained Spmoles "ARRESTOR" oligo # 218-95- 
03 (SEQ ID NO:599) and mix 2C contained 5 pmoles of "ARRESTOR" oligo # 218-95- 
01 (SEQ ID NO:600). After the 60 minute incubation at 67°C (the primary reaction 
described above), 2 \i\ of the secondary reaction mix was added to each sample: Mix 2 A 
was added to samples #1-10; Mix 2B was added to samples #1 1-20; and Mix 2C was 
added to samples #21-30. The temperature was adjusted to 52°C and the samples were 
incubated for 30 minutes at 52°C. The reactions were llien stopped by the addition of 10 
Hl of a solution of 95% formamide, 5 mM EDTA and 0.02% crystal violet. All samples 
were heated to 95°C for 2 minutes, and 4 |j,l of each sample were resolved by 
electrophoresis through 20% denaturing acrylamide gel (19:1 cross-linked) with 7 M 
urea, in a buffer containing 45 mM Tris-Borate (pH8.3) and 1.4 mM EDTA. The results 
were imaged using the Molecular Dynamics Fluoroimager 595, excitation 488, emission 
530. The resulting images are shown in Fig. 34. 

In Fig. 34, Panel A shows the results of the target titration when no ARRESTOR 
oligonucleotide was included in the secondary reaction; Panel B shows the results of the 
same target titration using an ARRESTOR oligonucleotide that extended 2 nt into the 
non-target complementary region of the primary probe; and Panel C shows the results of 
the same target titration using an ARRESTOR oUgonucleotide that extended 4 nt into the 
non-target complementary region of the primary probe. The product of the secondary 
cleavage reaction is seen as a band near the bottom of each panel. The first two lanes of 
each panel (i.e., 1 and 2, 1 1 and 12, 21 and 22) lacked target DNA, and the signal that co- 
migrates with the product band represents the nonspecific background under each set of 
conditions. 

It can be seen by visual inspection of these panels that the backgroimd signal is 
both reduced, and made more predictable, by the inclusion of either species of 
ARRESTOR oUgonucleotide. Li addition to reducing the background in the no-target 
control lanes, the background reduction in the reactions that had the more dilute amounts 
of target included is reduced, leading to a signal that is a more accurate reflection of the 
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target contained within the reaction, thus improving the quantitative range of the 
multiple, sequential invasive cleavage reaction. 

To quantify the impact of including the ARRESTOR oligonucleotide in the 
secondary cleavage reaction under these conditions, the average product band signal from 
5 the reactions having the largest amount of target {i.e., averages of the signals from lanes 9 
and 10, lanes 19 and 20, and lanes 29 and 30), were compared to the averaged signal 
from the no-target contol lanes for each panel, determine the "fold over background," the 
factor of signal amplification over backgroxmd, under each set of conditions. For the 
reactions without the ARRESTOR oUgonucleotide, Panel A, the fold over background 
10 was 5.3; for Panel B, the fold over background was 12.7; and for Panel C, the fold over 
background was 13.4, indicating that in this system inclusion of any ARRESTOR 
2 oUgonucleotide at least doubled the specificity of the signal over the ARRESTOR 
\M oUgonucleotide -less reactions, and that the ARRESTOR oligonucleotide that extended 
j5 sUghtly farther into the non-target complementary region may be sUghtly more effective, 
15 at least in this embodiment of the system. This clearly shows the benefits of using an 

^ ARRESTOR oligonucleotide to enhance the specificity of these reactions, an advmtage 

PI 

PI that is of particular benefit at low levels of target nucleic acid. 

j| EXAMPLE 15 

20 "ARRESTOR" Oligonucleotides Allow use of Higher Concentrations of Primary 

Probe Without Increasing Background Signal 

hicreasing the concentration of the probe in the invasive cleavage reaction can 
dramatically increase the amount of signal generated for a given amount of target DNA. 
While not intending to limit the explanation to any specific mechanism, this is beUeved to 

25 be caused by the fact that increased concentration of probe increases the rate at which the 
cleaved probe is supplanted by an uncleaved copy, thereby increasing the apparent 
tumover rate of the cleavage reaction. Unfortunately, this effect could not heretofore be 
appUed in the primary cleavage reaction of a multiple sequential INVADER assay 
because the residual uncleaved primary probe can hybridize to the secondary target, in 

30 competition with the cleaved molecules, thereby reducing the efficacy of the secondary 
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reaction. Elevated concentrations of primary probe exacerbate this problem. Further, the 
resulting complexes, as described above, can be cleaved at a low level, contributing to 
background. Therefore, increasing the primary probe can have the double negative effect 
of both slowing the secondary reaction and increasing the level of this form of non target- 
specific background. The use of an ARRESTOR oligonucleotide to sequester or 
neutralize the residual primary probe allows this concentration-enhancing effect to be 
applied to these sequential reactions. 

To demonstrate this effect, two sets of reactions were conducted. In the furst set 
of reactions, the reactions were conducted using a raQge of primary probe concentrations, 
but no ARRESTOR ohgonucleotide was supplied in the secondary reaction. In the 
second set of reactions, the same probe concentrations were used, but an ARRESTOR 
oligonucleotide was added for the secondary reactions. 

All reactions were performed in duphcate. Primary INVADER reactions were 
done in a final volume of 10 jil and contained: 10 mM MOPS, pH 7.5, 7.5 mM MgCk, 
500 fin of primary INVADER (218-55-05; SEQ ID NO:596); 30 ng of AfiiFENl enzyme 
and 10 ng of human genomic DNA. 100 zeptomoles of HBV pAM6 #2 amplicon was 
included in all even numbered reactions (by reference to Figs. 35 A and B). Reactions 
included 10 pmoles, 20 pmoles, 50 pmoles, 100 pmoles or 150 pmoles of primary probe 
(218-55-02; SEQ ID NO:601). MOPS, target and INVADER ohgonucleotides were 
combined to a final volume of 7 ^il. Samples were heat denatured at 95*^C for 5 minutes, 
then cooled to 67''C. During the 5 minute denaturation, MgCl2, probe and enzyme were 
combined. The primary INVADER reactions were initiated by the addition of 3 fxl of 
MgCl2, probe and enzyme mix, to the final concentrations indicated above. Reactions 
were incubated for 30 minutes at 67*^C. The reactions were then cooled to 52°C, and 
each primary INVADER reaction received the following secondary reaction components 
in a total volume of 4 jxl: 2.5 pmoles secondary target (oligo number 218-95-04; SEQ ID 
NO:602); 10 pmoles secondary probe (oHgo number 228-48-04; SEQ ID NO:598). The 
reactions that included the ARRESTOR oligonucleotide had either 40 pmoles, 80 pmoles, 
200 pmoles, 400 pmoles or 600 pmoles of ARRESTOR ohgonucleotide (ohgo number 
218-95-01; SEQ ID NO:600), added at a 4-fold molar excess over the primary probe 
amount for each reaction, with this mix. Reactions were then incubated at 52°C for 30 
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minutes. The reactions were stopped by the addition of 10 |li1 of a solution of 95% 
formamide, 10 mM EDTA aad 0.02% crystal violet. All samples were heated to 95®C 
for 1 minute, and 4 |j,l of each sample were resolved by electrophoresis through 20% 
denaturing acrylamide gel (19:1 cross-linked) with 7 M urea, in a buffer containing 45 
5 mM Tris-Borate (pH8,3) and 1 .4 mM EDTA. The results were imaged using the 
Molecular Dynamics Fluoroimager 595, excitation 488, emission 530. The resulting 
images for the reactions either without or with an ARRESTOR oUgonucleotide are shown 
in Figs. 35A and 35B, respectively. The products of cleavage of the secondary probe are 
seen as a band near the bottom of each panel 
10 In Fig. 35A, lane sets 1 and 2 show results with 10 pmoles of primary probe; 3 

and 4 had 20 pmoles; 5 and 6 had 50 pmoles; 7 and 8 had 100 pmoles; and 9 and 10 had 
H 150 pmoles. It can be seen by visual examination, that the increases in the amount of 
15 primary probe have the combined effect of slightly increasing the background in the no- 
•^J target lanes (odd numbers) while reducing the specific signal in the presence of target 

i^l 1 5 (even numbered lanes), and therefore the reducing the specificity of the reaction if 

W 

viewed as the measure of "fold over background," demonstrating that the approach of 
increasing signal by increasing probe cannot be applied in these sequential reactions. 

fl In Fig. 35B, lane sets 1 and 2 show results with 10 pmoles of primary probe; 

while 3 and 4 had 20 pmoles; 5 and 6 had 50 pmoles; 7 and 8 had 100 pmoles; and 9 and 

Q 20 10 had 150 pmoles. In addition, each reaction included 4-fbld molar excess of the 

ARRESTOR oligonucleotide added before the secondary cleavage reaction. It can be 
seen by visual examination that the background in the no-target lanes (odd numbers) is 
lower in all cases, while the specific signal in the presence of target (even numbered 
lanes) increases with increased amounts of primary probe, leading to a greater "fold over 
25 background" sensitivity at this target level. 

To quantitatively compare these effects, the fluorescence signal firom the products 
of both non-specific and specific cleavage were measured. The results are depicted 
graphically in Fig. 35C, graphed as a measure of the percentage of the secondary probe 
cleaved during the reaction, compared to the amount of primary probe used. 
30 Examination of the plots fi-om the no-target reactions confirms that the background in the 
absence of the ARRESTOR oligonucleotide is, in general, roughly two-fold higher, and 
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that both increase shghtly with the increasing probe amounts. The specific signals 
however, diverge between the two sets of reaction more dramatically. While the signal in 
the no-ARRESTOR oUgonucleotide reactions decreases steadily as primary probe was 
increased, the signal in the ARRESTOR oligonucleotide reactions continued to increase. 
At the highest primary probe concentrations tested, the no-ARRESTOR oligonucleotide 
reactions had specific signal that was only 1.7 fold over background, while the 
ARRESTOR ohgonucleotide reactions detected the 100 zmoles (60,000 copies) of target 
with a signal 6.5 fold over background, thus demonstrating the improvement in the 
sequential invasive cleavage reaction when an ARRESTOR oUgonucleotide is included. 

EXAMPLE 16 

Modified Backbones Improve Performance of ARRESTOR Oligonucleotides All 
Natural "ARRESTOR" Oligo With No 3'-Amine 

The reactions described in the previous two Examples used ARRESTOR 
oligonucleotides that were constructed using T 0-methyl ribose backbone, and that 
included a positively charged amine group on the 3' terminal nucleotide. The 
modifications were made specifically to reduce enzyme interaction with the primary 
probe/ARRESTOR oligonucleotide complex. During the development of the present 
invention, it was determined that the T 0-methy modified oligonucleotides are somewhat 
resistant to cleavage by the 5' nucleases, just as they are slowly degraded by nucleases 
when used in antisense applications {^ee e.g., Kawasaki et ah, J. Med. Chem., 36:831 
[1993]). 

The presence of an amino group on the 3' end of an oligonucleotide reduces its 
ability to direct invasive cleavage. To reduce the possibility that the ARRESTOR 
oligonucleotide would form a cleavage structure in this way, an amino group was 
included in the design of the experiments described in this and other Examples. 

Initial designs of the ARRESTOR oligonucleotides (sometimes referred to as 
"blockers") did not include these modifications, and these molecules were found to 
provide no benefit in reducing background cleavage in the sequential invasive cleavage 
assay and, in fact, sometimes contributed to background by inducing cleavage at an 
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unanticipated site, presumably by providing some element to an alternative cleavage 
structure. The effects of natural and modified ARRESTOR oligonucleotides on the 
background noise in these reactions are examined in this Example. 

The efficacy of an "all-natural ARRESTOR oUgonucleotide (i.e„ an ARRESTOR 
5 oligonucleotide that did not contain any base analogs or modifications) was examined by 
comparison to an identical reactions that lacked ARRESTOR oUgonucleotide, All 
reactions were performed in duplicate, and were conducted as follows. Two master 
mixes were assembled, each containing 12,5 mM MOPS, pH 7.5, 500 finoles primary 
INVADER oligonucleotide #218-55-05 (SEQ ID NO:596), 10 ng human genomic DNA 
10 (Novagen) and 30 ng AfuFENl enzyme for every 8 |li1 of mix. Mix A contained no 
added HBV genomic amplicon DNA, mix B contained 600,000 molecules of HBV 
P genomic amplicon DNA, pAM6 #2. The mixes were distributed to the reaction tubes, in 
i5j aliquots of 8 |il/tube as follows: mix A to tubes 1, 2, 5 and 6; and mix B to tubes 3, 4, 7 
SJ ^d 8. The samples were incubated at 95°C for 4 minutes to denature the HBV genomic 
yf 15 ampUcon DNA. The reactions were then cooled to 67°C and 2ul of a mix containing 37.5 
9 mM MgCl2 and 1 0 pmoles 2 1 8-55-02B (SEQ ID NO:603) for every 2 ^1, was added to 

fl each sample. The samples were then incubated at 67''C for 30 minutes. Two secondary 
reaction master mixes were prepared, each containing 10 pmoles of secondary probe 

\M 

0 oUgo #228-48-04N (SEQ ID NO:604) and 2.5 pmoles of secondary target ohgonucleotide 

fm 

' " 20 #21 8-95-04 (SEQ ID NO:602) for every 3 ^il of mix. Mix 2A contained no additional 
ohgonucleotide, while mix 2B contained 50 pmoles of the natural "ARRESTOR" 
ohgonucleotide #241-62-02 (SEQ ID NO:605), After the initial 30 minute incubation at 
67''C, the temperature was adjusted to 52°C, and 3 |li1 of a secondary reaction mix was 
added to each sample, as follows: Mix 2A was added to samples #1-4; and Mix 2B was 

25 added to samples #5-8. The samples were then incubated for 30 minutes at 52°C. The 
reactions were then stopped by the addition of 10 |j,l of a solution of 95% formamide, 10 
mM EDTA and 0.02% crystal violet. 

All of the samples were heated to 95^C for 2 minutes, and 4 ixl of each sample 
were resolved by electrophoresis through a 20% denaturing acrylamide gel (19:1 cross- 

30 hnked) with 7 M urea, in a buffer containing 45 mM Tris-Borate (pH8.3) and 1 .4 mM 
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EDTA. The results were imaged using the Molecular Dynamics Fluoroimager 595, 
excitation 488, emission 530. The resulting image is shown in Fig. 36A. 

To compare the effects of the various modifications made to the ARRESTOR 
oligonucleotides, reactions were performed using ARRESTOR oligonucleotides having 
5 all natural bases, but including a 3' terminal amine; ARRESTOR oligonucleotides having 
a 3' portion composed of 2' 0-methyl nucleotides, plus the 3' terminal amine; and 
ARRESTOR oligonucleotides composed entirely of T 0-methyl nucleotides, plus the 3' 
terminal amine. These were compared to reactions performed without an ARRESTOR 
oligonucleotide. The reactions were conducted as follows. Two master mixes were 
10 assembled, all mixes contained 14.3 mM MOPS, pH 7.5, 500 finoles primary INVADER 
oligo #218-55-05 (SEQ ID NO:596) and 10 ng human genomic DNA (Novagen) for 
j!:: every 7 p,! of mix. Mix A contained no added HBV genomic amplicon DNA, mix B 
O contained 600,000 molecules of HBV genomic amphcon DNA, pAM6 #2. The mixes 
J? were distributed to the reaction tubes, at 7 p,l/tube: mix A to tubes 1, 2, 5, 6, 9, 10, 13 and 

15 14; and mix B to tubes 3, 4, 7, 8, 1 1, 12, 15 and 16. The samples were warmed to 95°C 
^ for 4 minutes to denature the HBV DNA. The reactions were then cooled to 67*^0 and 3 

0 )x\ of a mix containing 25 mM MgCla, 25 pmoles 218-55-02B (SEQ ID NO:603) and 30 
J=fl ng AfuFENl enzyme per 3 |al, were added to each santrple. The samples were then 

1 J incubated at 67°C for 30 minutes. Four secondary reaction master mixes were prepared; 

20 all mixes contained 10 pmoles of secondary probe oUgonucleotide #228-48-04B (SEQ ID 
NO:606) and 2.5 pmoles of secondary target oHgonucleotide #218-95-04 (SEQ ID 
NO:602) for every 3 |al of mix. Mix 2A contained no additional oligonucleotide, while 
mix 2B contained 100 pmoles of the natural+amine ARRESTOR ohgonucleotide #241- 
62-01 (SEQ ID NO:607), mix 2C contained 100 pmoles of partially 0-methyl+amine 

25 oUgonucleotide # 241-62-03 (SEQ ID NO:608) and mix 2D contained 100 pmoles of all 
0-methyl+amine oligonucleotide #241-64-01 (SEQ ID NO:609). After the initial 30 
minute incubation at 67°C, the temperature was adjusted to 52'^C and 3 |li1 of a secondary 
reaction mix was added to each sample, as follows: mix 2A was added to samples #1-4; 
mix 2B was added to samples #5-8; mix 2C was added to samples #9-12; and mix 2D 

30 was added to samples #13-16. The samples were incubated for 30 minutes at 52°C, then 
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stopped by the addition of 10 of a solution of 95% formamide, 10 mM NaEDTA, and 
0.2% crystal violet. 

All samples were heated to 95°C for 2 minutes, and 4 of each sample were 
resolved by electrophoresis through a 20% denaturing acrylamide gel (19:1 cross-linked) 
5 with 7 M urea, in a buffer containing 45 mM Tris-Borate (pH83) and 1 A mM EDTA. 
The results were imaged using the Molecular Dynamics Fluoroimager 595, excitation 
488, emission 530. The resulting image is shown in Fig. 36B. 

In Fig. 36A, the left-hand panel shows the reactions that lacked an ARRESTOR 
ohgonucleotide, while the right hand panel shows the data from reactions that included 
10 the all natural ARRESTOR oligonucleotide. The first two lanes of each panel are from 
no-target controls, the second set of lanes contained target. The products of cleavage are 

O visible in the bottom one/fourth of each panel. The position at which the specific reaction 

P 

m products should run is indicated by arrows on left and right. 

Jl; It can be seen by examination of these data, that the reactions run in the absence 

Ml 

W 15 of ARRESTOR oligonucleotide show reproducible quality between the rephcates, and 
J. show significant cleavage only when target is present. In contrast, the addition of another 

^ unmodified oligonucleotide into the reactions causes great variation between the replicate 
I'l lanes (e,g., lanes 5 and 6 were provided with the same reactants, but produced markedly 

S different results). The introduction of the all-natural ARRESTOR ohgonucleotide 
20 produced, rather than reduced, background in these no-target lanes, and increased 

cleavage at other sites (z.e., the bands other that those indicated by the arrows flanking the 
panels). For these reasons the modifications that are described above, the effects of 
which are shown on Fig. 36B, were incorporated. 

The first 4 lanes of Fig. 36B show the products of duplicate reactions without an 
25 ARRESTOR ohgonucleotide, plus or minus the HBV target (lanes 1,2, and lanes 3, 4, 
respectively); The next 4 lanes, 5, 6 and 7, 8 used a natural ARRESTOR oligonucleotide 
having a 3' terminal amme; lanes 9, 10 and 1 1, 12 used the ARRESTOR oligonucleotide 
with a 3' portion composed of 2' 0-methyl nucleotides, and having a 3' terminal amine; 
lanes 13, 14 and 15, 16 used the ARRESTOR ohgonucleotide composed entirely of 2' O- 
30 methyl nucleotides and having a 3' terminal amine. The products of cleavage of the 
secondary probe are visible in the lower one third of each panel. 
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Visual inspection of these data shows that the addition of the 3' terminal amine to 
the natural ARRESTOR oligonucleotide suppresses the aberrmt cleavage seen in Fig. 
36A, but this ARRESTOR oUgonucleotide does not improve the performance of the 
reaction, as compared to the no-ARRESTOR oligonucleotide controls. In contrast, the 
use of the T 0-methyl nucleotides in the body of the ARRESTOR oligonucleotide does 
reduce background, whether partially or completely substituted. To quantify the relative 
effects of these modifications, the fluorescence from each of the co-migrating product 
bands was measured, the signals from the duplicate lanes were averaged and the "fold 
over background" was calculated for each reaction containing target nucleic acid. 

When ARRESTOR oligonucleotide was omitted, the target-specific signal (lanes 
3, 4) was 27-fold over the no target background; the natural ARRESTOR oligonucleotide 
+amine gave a signal of 17-fold over background; the partial 2' 0-methyl+ amine gave a 
signal of 47-fold over background; and the completely T 0-methyl+ amine gave a signal 
of 33 fold over background. 

These Figures show that both modifications can have a beneficial effect on the 
specificity of the multiple, sequential invasive cleavage assay. They also show that the 
use of the T 0-methyl substituted backbone, either partial or entire, markedly improves 
the specificity of these reactions. It is intended that, in various embodiments of the 
present inventon, any number of modifications that make either the ARRESTOR 
oUgonucleotide or the complex it forms with the primary target resistant to nucleases will 
provide similar enhancement. 

EXAMPLE 17 

Effect of ARRESTOR Oligonucleotide Length on Signal Enhancement in Multiple 

Sequential Invasive Cleavage Assays 

As noted in the Description of the Invention, the optimal length for an 
ARRESTOR oligonucleotide depends upon the design of the other nucleic acid elements 
of the INVADER reaction, particularly on the design of the primary probe. In this 
Example, the effects of varying the length of the ARRESTOR oUgonucleotide were 
explored in systems using two different secondary probes. A schematic diagram showing 
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these ARRESTOR oligonucleotides aligned as they would hybridize to the primary probe 
oligonucleotide is provided in Fig. 37C. In this Figure, the region of the primary probe 
that recognizes the target nucleic acid is shown underlined; the non-underlined portion, 
plus the first underlined base is the portion that is released by the first cleavage, and goes 

5 on to participate in the second or subsequent cleavage structure. 

All reactions were performed in duplicate. The INVADER reactions were done 
in a final volume of 10 ^il fmal volume containing 10 mM MOPS, pH 7.5, mM MgCh, 
500 finoles of primary INVADER 241-95-01, (SEQ ID NO:610), 25 pmoles of primary 
probe 241-95-02 (SEQ ID N0:611), 30 ng of AfiiFENl enzyme, and 10 ng of human 

10 genomic DNA, and if included, 1 amoles of HBV amplicon pAM 6 #2. MOPS, target 
DNA, and INVADER oligonucleotides were combined to a fmal volume of 7 |al. 
Samples were heat denatured at 95°C for 5 minutes, then cooled to 67°C. During the 5 

0 minute denaturation, MgCla, probe and enzyme were combined. The primary INVADER 

1 reactions were initiated by tiie addition of 3 \il of MgCl2, probe and enzyme mix, to the 
n 15 final concentrations indicated above. Reactions were incubated for 30 minutes at 67°C. 
^ The reaction were then cooled to 52°C, and each primary INVADER reaction received 
Q the following secondary reaction components in a total volume of 3 |j,1: 2.5 pmoles 

f'\ secondary target 241-95-07 (SEQ ID NO:612), 10 pmoles of either secondary probe 228- 

2 48-04 (SEQ ID NO:598), or 228-48-04N (SEQ ID NO:604) and 100 pmoles of an 

m 20 ARRESTOR oligonucleotide, either 241-95-03 (SEQ ID NO:613), 241-95-04 (SEQ ID 
NO:614X 241-95-05 (SEQ ID NO:615) or 241-95-06 (SEQ ID NO:616). The 
ARRESTOR oligonucleotides were omitted from some reactions as controls for 
ARRESTOR oligonucleotide effects. 

The reactions were incubated at 52°C for 34 minutes, and were then stopped by 
25 the addition of 10 ^il of 95% formamide, 10 mM EDTA, and 0.02% crystal violet. All 
samples were heated to 95''C for 1 minute, and 4 iitl of each sample were resolved by 
electrophoresis through 20% denaturing acrylamide gel (19:1 cross-linked) with 7 M 
urea, in a buffer containing 45 mM Tris-Borate (pH8.3) and 1.4 mM EDTA. The results 
were imaged using the Molecular Dynamics Fluoroimager 595, excitation 488, emission 
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530. The resulting images for the reactions with the shorter and longer secondary probes 
are shown in Figs. 37A and 37B, respectively. 

In each Figure, the products of cleavage are visible as bands in the bottom half of 
each lane. The first 4 lanes of each Figure show the products of duplicate reactions 
without an ARRESTOR oligonucleotide, plus or minus the HBV target (lanes sets 1 and 
2 respectively); in the next 4 lanes, sets 3 and 4 used the shortest ARRESTOR 241-95-03 
(SEQ ID NO:613); lanes 5 and 6 used 241-95-04 (SEQ ID NO:614); lanes 7 and 8 used 
241-95-05 (SEQ ID NO:615); and lanes 9 and 10 used 241-95-06 (SEQ ID NO:616). 

The principal background of concem is the band that appears in the "no target" 
control lanes (odd numbers; this band co-migrates with the target-specific signal near the 
bottom of each gel panel). Visual inspection shows that the shortest ARRESTOR 
oligonucleotide was the least effective at suppressing this background, and that the 
efficacy was increased when the ARRESTOR oligonucleotide extended further into the 
portion that participates in the subsequent cleavage reaction. Even with this difference in 
effect, it can be seen from these data that there is much latitude in the design of the 
ARRESTOR oUgonucleotide. The choice of lengths will be influenced by the 
temperature at which the reaction making use of the ARRESTOR oUgonucleotide is 
performed, the lengths of the duplexes formed between the primary probe and the target, 
the primary probe and the secondary target, and the relative concentrations of the 
different nucleic acid species in the reactions. 

EXAMPLE 18 

Effect of ARRESTOR Oligonucleotide Concentration on Signal Enhancement in 
Multiple Sequential Invasive Cleavage Assays 

In examining the effects of including ARRESTOR oligonucleotides in these 
cleavage reactions, it was of interest to determine if the concentration of the ARRESTOR 
oUgonucleotide in excess of the primary probe concentration would have an effect on 
yields of either non-specific or specific signal, and if the length of the ARRESTOR 
oUgonucleotide would be a factor. These two variables were investigated in the 
following Example. 
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All reactions were performed in duplicate. The primary INVADER reactions 
were done in a final volume of 10 |al and contained 10 mM MOPS, pH 7.5; 7.5 mM 
MgCl2, 500 finoles of primary INVADER 241-95-01 (SEQ ID NO:610), 25 pmoles of 
primary probe 241-95-02 (SEQ ID N0:61 1), 30 ng of AfuFENl enzyme, and 10 ng of 
5 human genomic DNA. Where included, the target DNA was 1 amole of HBV amphcon 
pAM 6 #2, as described above, MOPS, target and INVADER were combined to a final 
volume of 7 |il. The samples were heat denatured at 95°C for 5 minutes, then cooled to 
67°C. During the 5 minute denaturation, MgCl2, probe and enzyme were combined. The 
primary INVADER reactions were initiated by the addition of 3 |il of MgCli, probe and 
10 enzyme mix. The reactions were incubated for 30 minutes at 67°C. The reactions were 
then cooled to 52°C and each primary INVADER reaction received the following 
secondary reaction components: 2.5 pmoles secondary target 241-95-07 (SEQ ID 
NO:612), 10 pmoles secondary probe 228-48-04 (SEQ ID NO:598); and, if included, 50, 
100 or 200 pmoles of either ARRESTOR oligonucleotide 241-95-03 (SEQ ID NO:613) 
15 or 241-95-05 (SEQ ID NO:615), in a total volume of 3 ^il. Reactions were then 

incubated at 52''C for 35 minutes. Reactions were stopped by the addition of 10 \xl of 
95% formamide, 10 mM EDTA, and 0.02% crystal violet. All of the samples were 



O 
m 



p5 



|i| heated to 95*^C for 1 minute, and 4 \xl of each sample were resolved by electrophoresis 

m 

R| 20 containing 45 mM Tris-Borate (pH8.3), and 1 .4 mM EDTA. The results were imaged 



through 20% denaturing acrylamide gel (19:1 cross-linked) with 7 M urea, in a buffer 



using the Molecular Dynamics Fluoroimager 595, excitation 488, emission 530. The 
resulting images are shown as a composite image in Fig. 38. 

Each of the duplicate reactions were loaded on the gel in adjacent lanes and are 
labeled with a single lane number. All odd numbered lanes were no-target controls. 

25 Lanes 1 and 2 had no ARRESTOR oligonucleotide added; lanes 3-8 show results from 
reactions containing the shorter ARRESTOR oligonucleotide, 241-95-03 (SEQ ID 
NO:613); lanes 9-14 show results from reactions containing the longer ARRESTOR 
oligonucleotide, 241-95-05 (SEQ ID NO:615). The products of cleavage from the 
secondary reaction are visible in the bottom one third of each panel. Visual inspection of 

30 these data (i.e., comparison of the specific products to the background bands) shows that 
both ARRESTOR oHgonucleotides have some beneficial effect at all concentration. 
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To quantify the relative effects of ARRESTOR oligonucleotide length and 
concentration, the fluorescence from each of the co-migrating product bauds was 
measured, the signals from the duplicate lanes were averaged and the "fold over 
background" (signal+target/signal-target) was calculated for each reaction containing 
5 target nucleic acid. The reaction lacking an ARRESTOR oligonucleotide yielded a signal 
approximately 27-fold over background. Inclusion of the shorter ARRESTOR 
oligonucleotide at 50, 100 or 200 pmoles produced products at 42, 51 and 60-fold over 
background, respectively. This shows that while the short ARRESTOR at the lowest 
concentration seems to be less effective thaa the longer ARRESTOR oUgonucleotides 
10 (See, e.g., previous Example) this can be compensated for by increasing the concentration 
of ARRESTOR oUgonucleotide, and thereby the ARRESTOR oligonucleotiderprimary 
y probe ratio. 

np hi contrast, inclusion of the longer ARRESTOR oUgonucleotide at 50, 100 or 200 

t pmoles produced products at 60, 32 and 24 fold over background, respectively. At the 

w 

W 1 5 lowest concentration, the efficacy of this longer ARRESTOR oligonucleotide relative to 

. f\ 

J, ' the shorter ARRESTOR oUgonucleotide is consistent with the previous Example. 
2 Increasing the concentration, however, decreased the yield of specific product, suggesting 
ri a competition effect with some element of the secondary cleavage reaction. 

f5 These data show that the ARRESTOR oligonucleotides caa be used to advantage 

- ^ 20 in a number of specific reaction designs. The choice of concentration will be influenced 
by the temperature at which the reaction making use of the ARRESTOR oligonucleotide 
is performed, the lengths of the duplexes formed between the primary probe and the 
target, the primary probe and the secondary target, and between the primary probe and 
the ARRESTOR oligonucleotide. 
25 Selection of oUgonucleotides for target nucleic acids other than the HBV shown 

here, (e.g., oligonucleotide composition and length), and the optimization of cleavage 
reaction conditions in accord with the models provided here follow routine methods and 
common practice well known to those skilled in the methods of molecular biology. 
Example 10 demonstrated that some enzymes require an overlap between an 
30 upstream INVADER oUgonucleotide and a downstream probe oUgonucleotide to create a 
cleavage structure (Figure 27). It has also been determined that the 3* terminal nucleotide 
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of the INVADER oligonucleotide need not be complementary to the target strand, even if 
it is the only overlapping base in the INVADER oligonucleotide (e.g., as with the HCMV 
probes shown in Figure 30). The requirement for an overlap can serve as a convenient 
basis for detecting single base polymorphisms (SNPs) or mutations in a nucleic acid 
sample. 

For detection of single base variations, at least two oligonucleotides (e.g., 
a probe and an INVADER oHgonucleotide) hybridize in tandem to the target nucleic acid 
to fomi the overlapping structure recognized by the CLEAVASE enzyme to be used in 
the reaction. An impaired "flap" is included on the 5' end of the probe. The enzyme 
recognizes the overlap and cleaves off the unpaired flap, releasing it as a target-specific 
product. Enzymes that have a strong preference for an overlapping structure, i.e., that 
cleave the overlapping structure at a much greater rate than they cleave a non- 
overlapping structure include the FEN-1 enzymes from Archaeoglobus fulgidus and 
Pyrococcus furiosus and such enzymes are particularly preferred in the detection of 
mutations and SNPs. . 

EXAMPLE 19 

Kits for performing the mRNA INVADER Assay 

In some embodiments, the present invention provides kits comprising one or more 
of the components necessary for practicing the present invention. For example, the 
present invention provides kits for storing or dehvering the enzymes of the present 
invention and/or the reaction components necessary to practice a cleavage assay (e.g., the 
INVADER assay). By way of example, and not intending to limit the kits of the present 
invention to any particular configuration or combination of components, the following 
section describes one embodiment of a kit for practicing the present invention: 

In some embodiments, the kits of the present invention provide the following reagents: 

CLEAVASE enzyme (e.g.. Primary Oligos 

TthAKK) 



297 



RNA Primary Buffer 1 
RNA Secondary Buffer 1 
tRNA Carrier [20 ng/|il] 
Tioeo.i Buffer [lOmM Tris-HCl, 
pH 8, O.lmMEDTA] 



Secondary Oligos 

RNA Standard [100amol/|Lil] 

lOX Cell Lysis Buffer 1 



Examples of Primary Oligonucleotides and Secondary Oligonucleotides suitable 
for use with the methods of the present invention are provided in Figure 41 . While the 
oligonucleotides shown therein may fmd use in a number of the methods, and variations 
of the methods, of the present invention, these INVADER assay oligonucleotide sets fmd 
particular use with kits of the present invention. The oligonucleotide sets shown in 
Figure 41 may be used as individual sets to detect individual target RNAs, or may be 
comined in biplex or multiplex reactions for the detection of two or more analytes or 
controls in a single reaction. It is contemplated that the designs of these probes sets (e.g., 
the oligonucleotides and/or their sequences) may be adapted for use in DNA detection 
assays, using the guidelines for reaction design and optimization provided herein. 
Additional oUgonucleotides that find use in detection assays and kits of the present 
invention are provided in Figure 47. 

In some embodiments, a kit of the present invention provides a list of additional 
components (e,g,, reagents, supplies, and/or equipment) to be supplied by a user in order 
to perform the methods of the invention. For example, and without intending to limit 
such additional components Usts to any particular components, one embodiment of such a 
Ust comprises the following: 

■ RNase-free (e.g. , DEPC-treated) H2O 

■ Clear CHILLOUT-14 Uquid wax (MJ Research) or RNase-free, optical grade mineral 
oil (Sigma, Cat. No. M-5904) 

■ Phosphate-buffered saline (no MgCh, no CaCl2) 

■ 96-well polypropylene microplate (MJ Research, Cat. No. MSP-9601) 

■ 0.2-ml thin-wall tubes 

■ Thermaseal well tape (e.g., GeneMate, Cat. No. T-2417-5) 

■ Multichannel pipets (0.5-10^1, 2.5-20|il, 20-200|al) 
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Thermal cycler or other heat source {e,g,, lab oven or heating block). 

Fluorescence microplate reader (a preferred plate reader is top-reading, equipped with 

light filters have the following characteristics: 



Excitation 



Emission 



(Wavelength/Bandwidth) 



(Wavelength/Bandwidth) 



485 nm/20 nm 



530nm/25nni 



560 nm/20 nm 



620 nni/40 nm 



In some embodiments, a kit of the present invention provides a list of optional 
components (e.g., reagents, supplies, and/or equipment) to be suppUed by a user to 
facilitate performance of the methods of the invention. For example, and without 
intending to limit such optional components Usts to any particular components, one 
embodiment of such a list comprises the following: 

■ tRNA Solution, 20ng/^l (Sigma, R-5636) 

■ IX Stop Solution (lOmM Tris-HCl, pH 8, lOmM EDTA) 

■ Black opaque, 96-well microplate (e.g., COSTAR, Cat No. 3915) 

■ Electronic repeat pipet (250ili1) 

In some embodiments of a kit, detailed protocols are provided. In preferred 
embodiments, protocols for the assembly of INVADER assay reactions {e.g., 
formulations and preferred procedures for making reaction mixtures) are provided. In 
particularly preferred embodiments, protocols for assembly of reaction mixtures include 
computational or graphical aids to reduce risk of error in the performance of the methods 
of the present invention (e.g., tables to facilitate calculation of volumes of reagents 
needed for multiple reactions, and plate-layout guides to assist in configuring multi-well 
assay plates to contain numerous assay reactions). By way of example, and without 
intending to limit such protocols to any particular content or format, kits of the present 
invention may comprise the following protocol: 
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DETAILED mRNA INVADER ASSAY PROTOCOL 



Plan the microplate layout for each experimental run. An example microplate layout 
for 40 samples, 6 standards, and a No Target Control is shown in Fig, 40. Inclusion of 
a No Target Control (tRNA Carrier or IX Cell Lysis Buffer 1) and quantitation 
standards are required for absolute quantitation. 

Prepare the Primary Reaction Mix for either the single or biplex assay format. To 
calculate the volumes of reaction components needed for the assay (X Volume), 
multiply the number of reactions (for both samples and controls) by 1 .25 [X Volume 
(^il) = # reactions x 1,25]. Vortex the Primary Reaction Mix briefly after the last 
reagent addition to mix thoroughly. Aliquot 5[i\ of the Primary Reaction Mix per 
microplate well (an electronic repeat pipet is recommended for this step). 

Primary Reaction Mix 



Single Assay Format 



Reaction Components 


IX Volume 


X Volume 


RNA Primary Buffer 1 


4.0^il 




Primary Oligos 


0.25^1 




Tioeo.i Buffer 


0.25^1 




CLEAVASE enz. enzyme 


0.5^1 




Total Mix Volume (IX) 


5.0^,1 




Biplex Assay Format 


Reaction Components 


IX Volume 


X Volume 


RNA Primary Buffer 1 


4.0^1 




Primary Oligos 


0.25^1 




Housekeeping Primary Oligos 


0.25^il 




CLEAVASE enzyme 


0.5^1 




Total Mix Volume (IX) 


5.0^1 





Add 5|j.l of each No Target Control, standard, or sample (total RNA or cell lysate) to 
the appropriate well and mix by pipetting up and down 1-2 times. Overlay each 
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reaction with 10[xl of clear CHILLOUT or mineral oil. Seal microplate with 
Thermaseal well tape. 

Incubate reactions for 90 minutes at GO^'C in a thermal cycler or oven. 
While the primary reaction is incubating, prepare the Secondary FRET Reaction Mix 
for the single or biplex format. Calculate the component volumes required (X 
Volume) by multiplying the number of reactions (for both samples and controls) by 
1.25 [X Volume (|iil) = # reactions x 1.25 (id)]. Aliquot the Secondary FRET 
Reaction Mix into multiple 0.2-ml thin-wall tubes or an 8-well strip (70|j,l/tube is 
sufficient for a row of 12 reactions). 

Secondary FRET Reaction Mix 



Single Assay Format 



Reaction Components 


IX Volume 


X Volume 


RNA Secondary Buffer 1 


2.0^il 




Secondary Oligos 


1.5^il 




Tioeo.i Buffer 


l.5^l 




Total Mix Volume (IX) 


5.0^1 




Biplex Assay Format 


Reaction Components 


IX Volume 


X Volume 


RNA Secondary Buffer 1 


2.0jal 




Secondary Oligos 


1.5)^1 




Housekeeping Secondary 


L5\i\ 




Total Mix Volume (IX) 


5.0^1 





After the primary reaction incubation is completed, remove the microplate seal, and 
add 5 10,1 Secondary FRET Reaction Mix per well using a multichannel pipet. Mix by 
pipetting up and down 1-2 times. Reseal the microplate with the well tape and 
incubate the microplate at 60°C for 60 or 90 minutes, as indicated in each Product 
hiformation Sheet. The secondary reaction incubation time can be varied. See 
sections 2 of the PROCEDURAL NOTES FOR OPERATION OF THE mRNA 
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INVADER ASSAY for details. 



Reactions can be read using one of two procedures: Direct Read or Stop and 
Transfer . 

NOTE: Remove the microplate seal before reading the microplate. 
Direct Read Procedure 

■ This procedure enables collection of multiple data sets to extend the assay's 
dynamic range. During the secondary INVADER reaction, read the microplate 
directly in a top-reading fluorescence microplate reader. 

Recommended settings for a PerSeptive Biosystem Cytofluor 4000 instrument are as 
follows: 

Specific Gene Signal: Housekeeping Gene Signal: 



Excitation: 485/20nm Excitation: 560/20nm 

Emission: 530/25nm Emission: 620/40nm 

ReadsAVell: 10 ReadsAVell: 10 

Gain: 40 Gain: 45 

Temperature: 25°C Temperature: 25*^0 



NOTE: Because the optimal gain setting can vary between instruments, adjust 
the gain as needed to give the best signal/background ratio (sample raw signal 
divided by the No Target Control signal) or No Target Control sample readings of 
-100 RFUs. Fluorescence microplate readers that use a xenon lamp source 
generally produce higher RFUs. For directly reading the microplates, the probe 
height of, and how the plate is positioned in, the fluorescence microplate reader 
may need to be adjusted according to the manufacturer's recommendations. 

Stop and Transfer Procedure 

1. Prepare IX Stop Solution (lOmM Tris-HCl, pH 8, lOmM EDTA) with 
RNase-free H2O. Add 100|al per well with a multichannel pipet. 

2. Transfer lOOjal of the diluted reactions to a black microplate (e.g., 
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COSTAR (Coming), Cat. No. 3915). 

3. Read the microplate using the same parameters as the Direct Read 

Procedure, but adjust the gain to give No Target Control sample readings of -100 
RFUs (see NOTE above). 

In some embodiments, supplemenatary documentation, such as protocols for 
ancillary procedures, e.g., for the preparation of additional reagents, or for preparation of 
samples for use in the methods of the present invention, are provided. In preferred 
embodiments, supplementary documentation includes guidelines and lists of precautions 
provided to facilitate successful use of the methods and kits by unskilled or inexperienced 
users. In particularly preferred embodiments, supplementary documentation includes a 
troubleshooting guide, e.g., a guide describing possible problems that may be 
encountered by users, and providing suggested solutions or corrections to intended to aid 
the user in resolving or avoiding such problems. 

For example, and without intending to limit such supplementary documentation to 
any particular content, kits of the present invention may comprise any of the following 
procedures and guidelines: 

II. AVOIDANCE OF RNase CONTAMINATION 

To avoid RNase contamination during sample preparation and testing, in one 
embodiment, the user is cautioned to observe the following precautions: 

• Wear disposable gloves at all times to avoid contact with samples and reagents. 

• Use certified RNase-fi*ee disposables, including thin-wall polypropylene tubes and 
aerosol-barrier pipet tips, for preparing samples and assay reagents, to avoid cross- 
contamination. 

• Use RNase-firee (DEPC-treated) H2O for diluting samples and/or reagents. 

• Keep RNA samples and controls on ice during assay setup. 
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in. SAMPLE AND CONTROL PREPARATION 

NOTE: Dilute both standards and samples to concentrations that correspond to a 
5-|al addition per reaction. 

Example 1: The concentration of a S-attomole standard is lamol/nl. lamol = 10" 

mole = 602,000 molecules. 
Example 2: The concentration of a 100-ng sample should be 20ng/|j,l. 

A. Control Preparation 

No Target Control : 

Total RNA Format: tRNA Carrier (20ng/^l) 

Cell Lysate Format: IX Cell Lysis Buffer 1 (dilute lOX Cell Lysis Buffer 1 to IX 
with RNase-free H2O) 
Positive Control : RNA Standard (Std) (lOOamol/^il in vitro transcript) 

1 . Prepare RNA standards by diluting the positive controls with tRNA Carrier (when 
running total RNA samples) or with IX Cell Lysis Buffer 1 [lOX Cell Lysis Buffer 
1 diluted with RNase-free H2O] (when running cell lysate samples). The Product 
Information Sheet included in each kit indicates the recommended standard test 
levels and preparation methods. 

2. Using a fresh set of standards for each run is recommended. Store the standards on 
ice during reaction setup. 

B. Total RNA Sample Preparation 

1 . Prepare total RNA from cells or tissue according to manufacturer's instructions for 
the selected preparation method. Recommended methods include TRIZOL (Life 
Technologies, Rockville, MD), RNEASY (Qiagen, Valencia, CA), and RNA WIZ 
(Ambion, Austin, TX). 
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2. Dilute total RNA samples with RNase-free H2O to the appropriate concentration. 



C. Cell Lysate Sample Preparation -- 96-well microplate format 

NOTE: This cell lysate detection format is used for adherent cells cultured in 96-well 

tissue culture microplates. Cells are typically seeded at 10,000-40,000 cells per well. 

Different seeding densities maybe required depending on cell type and/or mRNA 

expression levels. See Procedural Notes for more details. For cells exhibiting high 

expression, the following methods can be used to attenuate the signal from the cell 

lysates: 

■ plate fewer cells per well; 

■ dilute the cell lysates with IX Cell Lysis Buffer 1 before addition to the 
reaction (e.g., 2.5|li1 lysate + 2.5ili1 IX Cell Lysis Buffer 1); 

■ read the reaction microplate 1 5-30 minutes after addition of the 
Secondary FRET Reaction Mix instead of the recommended 60-90 
minutes; 

L Dilute lOX Cell Lysis Buffer 1 to a IX concentration with RNase-free H2O. 

2. Using a multichannel pipet, carefully remove the culture medium from the wells of 
adherent cells without disturbing the cell monolayer. 

3. Wash the cells once with 200ial PBS (no MgCb, no CaCl2) and carefiiUy remove the 
residual PBS with the multichannel pipet. 

4. Add 40|al IX Cell Lysis Buffer 1 per well. Lyse cells at room temperature for 3-5 
minutes. 

5. Using a multichannel pipet, careftilly transfer 25|il of each lysate sample into a 96- 
well microplate. Avoid transferring cellular material from the bottom of the well, 

6. Overlay each lysate sample with 10|il clear CHILLOUT or mineral oil (overlaying is 
not necessary if using a heated-hd thermal cycler). 

7. Seal microplate with Thermaseal well tape. Immediately heat lysates at 75-80°C for 
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15 minutes in a thermal cycler or oven to inactivate cellular nucleases. 

8. During the heating step, proceed with the reaction setup. See DETAILED mRNA 
INVADER ASSAY PROTOCOL (above) for instructions. 

9. After the heat inactivation step, add the lysate samples immediately to the reaction 
microplate. Alternatively, the lysate samples can be quickly transferred to a -70°C 
freezer for later testing (long-term stability has not been estabUshed and may differ 
for each cell type). 

IV. PROCEDURAL NOTES FOR OPERATION OF THE mRNA INVADER 
ASSAY 

1. RNA sample types and optimization of RNA sample amount. 

The assay is optimized for performance with total RNA samples prepared from either 
tissue or cells. Several total RNA preparation methods/kits have been validated 
for performance in the mRNA INVADER assay: 

• TRIZOL (Life Technologies, Rockville, MD) 

• RNeasy (Qiagen, Valencia, CA) 

• RNA WIZ (Ambion, Austin, TX) 

It is important to use a method or kit that minimizes the level of genomic DNA, which 
can inhibit signal generation. Performance of a preliminary experiment is recommended 
to determine the amount of total RNA sample (typically l-200ng, depending on the 
gene's expression level) that provides the best limit of detection and dynamic range. 

The assay has also been validated with lysate samples from a number of cell types. 
Recommended cell densities in a 96-well tissue culture microplate are 10,000-40,000 
cells per well depending on cell type and expression level of the gene of interest. 
Performance of a preliminary experiment is recommended for any given cell line and/or 
gene being monitored. Such an experiment should include different cell density levels 
and/or dilution of the lysate samples with IX Cell Lysis Buffer 1 (e.g. a IjllI test level is 
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prepared by mixing lysate sample + 4|al IX Cell Lysis Buffer 1 for a 5^1 sample 
addition). 

2. Dynamic range modulation: variable secondary reaction incubation times* 

The length of the secondary reaction incubation time listed in the protocol is 
sufficient for most analytes. However, the linear detection range (Signal/Background < 
15-25) can be adjusted by reading the reaction microplate at variable times after addition 
of the secondary FRET reagents. For example, high expression samples can often be 
detected in 15-30 minutes. The Direct Read method (DETAILED mRNA INVADER 
ASSAY PROTOCOL, step 7) enables simple optimization of the secondary reaction time 
as the reaction microplate can be incubated further if an early time read does not provide 
enough signal from the samples being tested. 

Monitoring the secondary reaction fluorescence signal with time can also extend 
the dynamic range of the assay. The Direct Read method at multiple time points can be 
applied using low-cost instrumentation. Alternatively, real-time fluorescence 
instrumentation can be used to achieve comparable dynamic ranges exhibited by other 
mRNA quantitation methods. 

3. Dynamic range modulation: variable sample levels. 

While the FRET detection method greafly simphfies the assay, the dynamic range is 
typically Umited to 2-3 logs when using an endpoint read method. However, since 
mRNA INVADER assay signal is generated linearly with both target level and time, the 
easiest method for extending the dynamic range beyond 3 logs (as may be required, e.g., 
for highly induced genes) is to adjust total RNA sample levels. Fold changes in gene 
expression (treated sample signal divided by untreated sample signal) can be reliably 
calculated using normalized sample signals. This is accomplished by testing sample 
levels that give signal within the linear detection range defined by the standard curve. 
For example, the fold induction for a highly induced sample can be calculated as follows: 
Fold induction = (Net Signal for 1 ng treated sample X 100) / Net Signal for 100 ng 
untreated sample 
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V. TROUBLESHOOTING GUIDE 



Problem 


Possible Solution 


No signal 


• Check that the fluorescence microplate reader has been set up 
correctly and that the appropriate excitation and emission filters are 
in place. 

« pprfnrm mRMA TlsJV ADFl^ fl<5<iflv with the nrovided standard as a 
positive control. 

• Potential RNase contamination of the samples and reagents. 
Discard suspect reagents. 

• Use only reagents and oligonucleotides supplied in the kit. Do not 
mix reagents or oligonucleotides between kits. 


High variation 

between 

replicates 


• Always work with master primary and secondary reaction mixes. 

• Thoroughly mix all master mixes and samples. 

• Pipet in a similar manner across all the controls and samples. 

• Calibrate pipets frequently. 


Lack of low 
target level 
detection 


• Calibrate thermal cycler or heat block. 

• Minimize assay variability (see above), i.e. CVs are less than 5% 
for the sample replicates. This is particularly important for 
detecting low target levels. 


Lack of 
discrimination 
between hish 
signal samples 


• Decrease secondary reaction incubation time to achieve detection 
within the linear range of the assay. 

• Use less total RNA per reaction. 

• Attenuate cell lysate sample signal (see NOTE, Sample and Control 
Preparation, Part C). 


Signal 
inhibition 


• Run samples on an agarose gel to check for presence of genomic 
DNA. Alter the RNA sample isolation method to minimize 
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genomic DNA or presence of other inhibitors. The same isolation 




procedure should be used throughout an experiment. 




• If using the cell lysate format, residual PBS can be inhibitory. Be 




sure to remove residual PBS from the tissue culture microplate. Do 




not use PBS that contains MgCli or CaCb , which inhibits the 




assay. 
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APPENDIX A: 



mRNA INVADER SINGLE ASSAY WORKSHEET 
mRNA INVADER ASSAY PROCEDURE 

■ Prepare samples and controls. 

■ Prepare Primary Reaction Mix, Vortex briefly and aliquot 5 )il per well. 

■ Add 5 [0,1 sample or control per well and pipet up and down 1-2 times. 

■ Add 10|Lil CHILLOUT or mineral oil per well. 

■ Incubate primary reaction at 60°C for 90 minutes. 

■ Prepare Secondary FRET Reaction Mix, vortex briefly. 

■ Using a multichannel pipet, aliquot 5\i\ per well and pipet up and down 1-2 times. 

■ Incubate secondary reaction at 60°C for 60 or 90 minutes. 

■ Read microplate in fluorescence microplate reader (FAM Dye: Ex. 485 nm/Em. 
530 nm). 



PRIMARY REACTION MIX 



Reaction Components 


IX Volume 


X Volume (No. of 
reactions x 1.25'> 


RNA Primary Buffer 1 


4.0|al 




Primary Oligos 


0.25|al 




Tioeo.i Buffer 


0.25^1 




CLEAVASEenzyme 


0.5|al 




Total Mix Volume (IX) 


5.0|al 




SECONDARY FRET REACTION MIX 


Reaction Components 


IX Volume 


X Volume 
fNo. of reactions x 1.2S\ 


RNA Secondary Buffer 1 


2.01^1 




Secondary Oligos 


1.5^1 




Tioeo.i Buffer 


1.5^1 




Total Mix Volume (IX) 


5.0(xl 
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APPENDIX B: 



mRNA INVADER BIPLEX ASSAY WORKSHEET 

mRNA INVADER ASSAY PROCEDURE 

■ Prepare samples and controls. 

■ Prepare Primary Reaction Mix. Vortex briefly and aliquot 5)il per well 

■ Add 5|li1 sample or control per well and pipet up and down 1-2 times. 

■ Add 1 0|il CHILLOUT or mineral oil per well. 

■ Incubate primary reaction at 60°C for 90 minutes. 

■ Prepare Secondary FRET Reaction Mix, vortex briefly. 

■ Using a multichannel pipet, aliquot 5|li1 per well and pipet up and down 1-2 times. 

■ Incubate secondary reaction at 60°C for 60 or 90 minutes. 

■ Read microplate in fluorescence microplate reader (FAM Dye: Ex. 485 nm/Em. 
530 nm and red dye: Ex. 560nm/Em. 620nm). 



PRIMARY REACTION MIX 



Reaction Components 


IX Volume 


X Volume 
(# reactions x 1.25") 


RNA Primary Buffer 1 


4.0^il 




Primary Oligos 


0.25 1^1 




Housekeeping Primary Oligos 


0.25^1 




CLEAVASE IX enzyme 


0.51x1 




Total Mix Volume (IX) 


5.o^ll 




SECONDARY FRET REACTION MIX 


Reaction Components 


IX Volume 


X Volume 
(# reactions x LlSt 


RNA Secondary Buffer 1 


2.01x1 




Secondary Oligos 


1.5|xl 




Housekeeping Secondary Oligos 


1.5^1 




Total Mix Volume (IX) 


S.Ojxl 
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EXAMPLE 20 



Additional RNA INVADER Assay Embodiments 

1 . ARRESTOR oligonucleotides in single-step INVADER assays. 

ARRESTOR oligonucleotides can be used to selectively stop signal accumulation 
for particular target nucleic acids in a reaction, such that signal can continue to 
accumulate for other target nucleic acids in the same reaction mixture. The following is a 
description of one embodiment in which it is desirable to selectively arrest a particular 
reaction within a mixture. 

The 18S ribosomal RNA is usefol as an intemal standard in RNA quantitation 
assays, e.g., for use in normalizing the signal generated from a specific gene or RNA of 
interest in the same sample. As an intemal control, it is preferred that the signal should 
report both sample variability (e.g., dead cells in cell lysate samples, cell plating errors, 
RNA degradation in a particular total RNA sample) and assay variability (e.g., pipetting 
errors, uneven heating across the plate). Because 18S rRNA is present in samples such as 
cell lysates in relatively high concentrations compared, e.g., to most mRNAs, the 18S can 
often be detected in a single- step INVADER assay, without the need for a secondary or 
subsequent reaction. One consideration of running a single-step INVADER assay 
reaction in the same mixture as a multi-step INVADER assay reaction is that the multi- 
step reactions may involve changes of conditions (e.g., a solution addition or a 
temperature shift) between a primary reaction and subsequent reaction steps. For 
example, in the mRNA detection assays described herein, the primary reaction conditions 
are optimal for RNA detection, and the secondary reaction conditions are optimal for 
DNA detection. Accumulating single-step assay signal under two different conditions 
may complicate quantitation. Further, because of the high concentration of the 18S RNA, 
too much signal may sccumulate from this target. The reaction components may be 
exhausted, such that the signal reaches a plateau that is not an accurate reflection of the 
18S concentration, or the accumulated signal may overwhelm the signal from other 
nucleic acids in the reaction. In some embodiments, it is preferable to limit the signal 
from the 18S rRNA. For example, a primary reaction with may be initiated with probe 
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sets for the 1 8S rRNA and for one or more other targets. An ARRESTOR 
oligonucleotide to stop the 18S detection reaction may be added along with the secondary 
reaction reagents for the other target. 

2. Reactions without Arrestor oUgonucleotides. 

In some embodiments, this format involves running a high-temperature primary reaction 
(e.g. 65C) and a low-temperature secondary reaction (e.g. 55C) (e.g.. Figure 50A). All 
INVADER reaction reagents are combined at the same time, and temperature is used to 
segregate the primly and secondary reactions. In this embodiment, FRET background 
does not form in the primary reaction because the FRET oUgonucleotide cannot stably 
bind to the secondary reaction template (SRT). In one embodiment of this format, the 
reaction components and conditions are as follows: 

Reaction components 

5- lOpmoles Probe 
5 pmoles INVADER oHgo 
3 pmoles Stacking oligo 
10 pmoles FRET oligo 

6- 12 pmoles SRT (1.2-fold excess over Probe level) 
standard RNA reaction buffer 
standard MgS04 level 

20 ng CLEAVASE enzyme (ag., CLEAVASE VII, IX). 

In preferred embodiments, primary reaction components are mixed and aliquoted 

25 into the reaction plate (e.g. 5 jil per well). 

The RNA sample is added (e.g. in a 5 |li1 aliquot). 

The reaction plate is incubated for 90 minutes (can be shorter or longer) at 65C 
for the primary reaction. 

The reaction plate is incubated for 15-90 minutes (depending on the expected 
30 expression level and assay sensitivity) at 55C for the secondary reaction. 

The plate (96-well or 384-well) is measured in a fluorescence plate reader. 
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In another embodiment, a Mini-Arrestor is provided (Figure SOB) This format is 
run similarly to the No Arrestor format except that a Mini-Arrestor oligonucleotide is 
included as well. In some embodiments, Mini-Arrestor oligonucleotides may be 
5 synthesized with 2'-0Me nucleotides. When the temperature is reduced to start the 

secondary reaction, the remaining uncleaved Probe binds to the Mini-Arrestor instead of 
the SRT. Consequently, a FRET oligonucleotide in the secondary reaction binds only to 
the specific secondary substrate (comprising, e.g., cleaved 5' arm stably bound to an 
SRT). 

10 

TABLE 2 



15 



Klenow 


Kcat Cs" 


Km(dNTP^ 


KdCnM^ 


Relative DNA 


Reference 


TaaPol 




h 






affinity 






Wild-Type 


2.4 


2.8 


8 


1 


2 


Wild-Type 


S610A 


n. d. 




n.d. 




5 


S515 


R668A 


0.006 


6.5 


140, 150 


0.06, 0.05 


1,2 


R573* 


N678A 


n. d. 




n.d. 




5 


N583 


E710A 


0.1 


15 


250 


0.03 


2 


E615* 


E710D 


1.7 


7.7 


110 


0.07 


2 


E615* 


K758A 


0.131 


15.6 




0.63 


4 


K663 


K758R 


2.0 


2.1 




1.125 


4 


K663 


Y766S 


0.8 


6.4 


13 


0.4, 0.6 


1,2 


Y671 


R841A 


0.3 


9.8 


40,53 


0.2 


1,2 


R746* 


N845A 


1.0 


23 


8,5 


1.0, 1.7 


1,2 


N750* 


N845Q 


0.03 


1.7 


80, 55 


0.1, 0.2 


1,2 


N750* 


Q849A 


0.02 


3.8 


100, 160 


0.08, 0.05 


1,2 


Q754 
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Q849E 


0.001 


n. d. 


90,91 


0.09 


1,2 


Q754 


H881A 


0.3 


3.3 


20, 28 


0.4, 0.3 


1,2 


H784* 


D882N 


<0.0001 


n. d. 


30 


0.6 


2 


D785 


D882S 


0.001 


7.5 


0.9 


9 


2 


D785 



References: 

1. JBC (1990) 265:14579-14591 

2. JBC (1992) 267:8417-8428 

5 3. Eur. J. Biochem (1993) 214:59-65 

4. JBC (1994) 269:13259-13265 

5. Nature (1996) 382:278-281 



10 TABLE 3: Rational mutations in the polymerase region 
A. DNA activity table 



IdT %Tth %Taq4M HP X 



TthDNRXHT 


31.91 


100% 


83% 


3.81 


101.9 


TthDNRXHT 
H641A 


23.61 


74% 


62% 


5.32 


221.24 


TthDNRXHT 
R748A 


22.1 


69% 


58% 


4.39 


88.17 


Tth DN RX HT 
H786A 


34.31 


108% 


90% 


7.75 


185.35 


TthDNRXHT 
H786A/G506K/ 
Q509K(AKK) 


32.1 


101% 


84% 


5.7 


332.8 


TaqDNRXHT 

W417L/G418K/ 
E507Q/H784A 


38.23 


120% 


100% 


68.21 


1100.18 
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(Taq 4M) 












Taq 4M G504K 


36.04 


113% 


94% 


31.76 


417.40 


Taq4MH639A 


42.95 


135% 


112% 


91.46 


2249.67 


Taq 4M R587A 


44.78 


140% 


117% 


143.0 


252.69 


TaqDNRXHT 

W417L/G418K/ 

G499R/A502K/ 

I503L/K504N/ 

E07K/H784A 

(TaqSM) 


43.95 


138% 


115% 


122.53 


346.56 


TaqSS R677A 


32.3 


101% 


84% 


206.9 


2450.0 



B. RNA activity table 

IrTl %Tth %Taq4M 



TthDNRXHT 


0.89 


100% 


34% 


TthDNRXHT 
H641A 


1.18 


133% 


45% 


TthDNRXHT 
R748A 


1.34 


151% 


51% 


TthDNRXHT 
H786A 


1.31 


147% 


49% 


TthDNRXHT 
H786A/G506K/ 
Q509K (AKK) 


1.59 


179% 


60% 


TaqDNRXHT 
W417UG418K/ 
E507Q/H784A 
(Taq 4M) 


2.65 


298% 


100% 


Taq 4M G504K 


2.76 


310% 


114% 


Taq4MH639A 


3.89 


437% 


147% 
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Taq4MR587A 


3.13 


352% 


118% 


TaqDNRXHT 


4.00 


450% 


151% 


W417L/G418K/ 








G499R/A502K/ 








I503L/K504N/ 








E07K/H784A 








(TaqSM) 








TaqSS R677A 


2.22 


249% 


84% 



TABLE 4: Rational arch mutations 
DNA activity table 





IdT 


%Tth 


%Taq4M 


HP 


X 


Taq4M 
P88E/P90E 


10.20 


32% 


27% 


2.00 


97.00 


Taq4M 
G80E 


26.30 


82% 


69% 


103.6 


2900 


Taq4M 
L109F/A110T 


36.45 


114% 


95% 


19.71 


749.69 


RNA activity table 


IrTl 


%Tth 


%Taq4M 







Taq4M 


Taq4M 


0.10 


11% 


4% 


P88E/P90E 


P88E/P90E 








Taq4M 


Taq4M 


3.11 


349% 


117% 


G80E 


G80E 








Taq4M 


Taq4M 


2.45 


275% 


92% 


L109F/A110T 


L109F/A110T 
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TABLE 5: Arch/thumb combinations 
DNA activity table 





IdT 


%Tth 


%Taq4M 


HP 


X 


TaqW417L/ 


63.33 


198% 


166% 


177.05 


202.32 


G418K/E507K/ 












H784A/L109F/ 












A110T/G499R/ 












A502K/I503L/ 












G504K/E507K/ 












T514S (TaqSS) 












Taq P88E/P90E/ 
W417L/G418K/ 
G499R/A502K/ 
I503L/G504K/ 
E507K/T514S/ 
1 H784A 


36.48 


114% 


95% 


9.44 


70.35 



RNA activity table 





IrTl 


%Tth 


%Taq4M 


Taq W417L/ 


3.16 


355% 


119% 


G418K/E507K/ 








H784A/L109F/ 








A110T/G499R/ 








A502K/I503I7 








G504K/E507K/ 








T514S (TaqSS) 
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TaqP88E/P90E/ 


0.22 


25% 


8% 


W417L/G418K/ 








G499R/A502K/ 








I503L/G504K/ 








E507K/T514S/ 








H784A 









TABLE 6: Helix-hairpin-helix random mutagenesis 
DNA activity table 





IdT 


%Tth %Taq4M 


HP 


X 


TaqSS 
K198N 


23.4 


73% 


61% 


25.7 


1233.1 


TaqSS 
A205Q 


25.6 


80% 


67% 


13.4 


699.1 


TaqSS 
T204P 


11.2 


35% 


29% 


1.9 


209.4 


TaqSS 

I200M/A205G 


16.8 


53% 


44% 


7.8 


597.2 


TaqSS 
K203N 


25.9 


81% 


68% 


36.6 


1429.8 


TthDNRXHT 
H786A/P197R/K200R 


10.7 


33% 


28% 


3.2 


66.3 


TthDNRXHT 


11.5 


36% 


30% 


6.1 


327.5 


H786A/K205Y 












Tth DN RX HT 
H786A/G203R 


18.3 


57% 


48% 


2.1 


98.8 
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RNA activity table 

IrTl %Tth %Taq4M 



TaqSS 
K198N 


1.22 


137% 


46% 


TaqSS 
A205Q 


0.62 


70% 


23% 


TaqSS 
T204P 


0.36 


40% 


14% 


TaqSS 

I200M/A205G 


0.77 


87% 


29% 


TaqSS 
K203N 


2.09 


235% 


79% 


TthDNRXHT 
H786A/P197R/K200R 


0.47 


52% 


18% 


TthDNRXHT 


0.68 


77% 


26% 


H786A/K205Y 








TthDNRXHT 
H786A/G203R 


1.61 


180% 


61% 



10 TABLE 7: Random thumb mutations 
DNA activity table 





IdT 


%Tth 


%Taq4M 


HP 


X 


TaqDNRXHT 


59.96 


188% 


157% 


133.65 


907.41 


W417L/G418K/ 
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E507K/H784A/G499R/ 
A502K/K504N/ (Ml -13) 












TaqDNRXHT 

W417L/G418K/ 

/H784A/L500I/Q507H 

A502K/G504K(Ml-36) 


46.74 


146% 


122% 


123.11 


822.61 


TaqDNRXHT 
W417L/G418K/G499R/ 
A502K/G504K/E507K/ 
H784AAr514S (M2-24) 


85.7 


269% 


224% 


369.96 


3752.12 


TaqDNRXHT 
W417L/G418K/G499R/ 
A502K/G504K/E507K/ 
H784AV518L(M2-06) 


76.7 


240% 


201% 


355.87 


2038.19 

1 



RN A activity table 

IrTl %Tth %Taq4M 



TaqDNRXHT 
W417L/G418K/ 
E507KyH784A/G499R/ 
A502KyK504N/ (Ml-13) 


2.55 


287% 


96% 


TaqDNRXHT 
W417L/G418K/ 
/H784A/L500I/Q507H 

A502K/G504K(Ml-36) 


2.71 


304% 


102% 


TaqDNRXHT 


4.43 


498% 


167% 
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W417UG418K/G499R/ 
A502K/G504K/E507K/ 
H784A/T514S (M2-24) 








Taq DN RX HT 
W417L/G418K/G499R/ 
A502K/G504K/E507K/ 
H784AA^518L (M2-06) 


3.56 


400% 


134% 



TABLES: Chimeric mutants 



5 A. DNA activity table 





IdT2 


%TthAKK 


HP 


X 


TthAKK 


34.18 


100% 


5 


393 


Taq 4M G504K 


40.19 


105% 


28 


1991 


Tfi DN 2M 


36.60 


106% 


289 


1326 


Tsc DN 2M 


25.49 


75% 


283 


2573 


TaqTthAKK 


63.89 


187% 


32 


1658 


TtbTaq4MG504K 


25.03 


73% 


8 


627 


TfiTthAKK 


34.13 


100% 


15 


459 


TscTthAKK 


35.23 


103% 


29 


2703 


TfiTaq4MG504K 


35.69 


104% 


37 


872 


TscTaq 4M 


30.04 


88% 


25 


2008 


G504K 











10 B. RN A activity table 

IrT3 %TthAKK 
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TthAKK 


2.27 


100% 


Taq4MG504K 


2.31 


102% 


Tfi DN 2M 


0.20 


9% 


TscDN2M 


0.29 


13% 


TaqTthAKK 


6.81 


300% 


TthTaq 4M 
G504K 


1.09 


48% 


TfiTthAKK 


1.24 


55% 


TscTthAKK 


9.65 


4.25% 


TfiTaq 4M G504K 


1.05 


46% 


TscTaq 4M 
G504K 


2.95 


130% 



In Figures 47 and 49, and throughout the specification, the terms "Z21", "Z28" and "EQ" 
refer to ECLIPSE Quencher dye. "Red" and "RR" denote the REDMOND RED dye. 
5 "Epoch Yellow" refers to YAKIMA YELLOW dye. All of these dyes and quenchers are 
available from Synthetic Genetics, San Diego, CA. 

All pubUcations and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described 

10 methods and systems of the invention will be apparent to those skilled in the art without 
departing from the scope and spirit of the invention. Although the invention has been 
described in connection with specific preferred embodiments, it should be understood 
that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for cairying out the invention 

1 5 which are obvious to those skilled in the relevant fields are intended to be within the 
scope of the following claims. 
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